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Method for producing a nucleotide sequence construct with optimised codons for an HIV 
genetic vaccine based on a primary, early HIV isolate and synthetic envelope BX08 
constructs. 



5 Field of the invention 

The invention relates to a DNA vaccine against HIV, which is designed from a clinical 
primary isolate. One aspect of the invention relates to a method of producing a nucleotide 
sequence construct, in a prefered aspect based on a cassette system, the nucleotide 
sequence construct being used as a DNA vaccine. The method can, for example, lead to the 
10 disclosed synthetic BX08 HIV-1 envelope vaccine nucleotide sequence construct, designed 
to generate suitable DNA vaccines against HIV, specifically HIV-1. Furthermore, the 
invention can be used for the production of recombinant protein antigens. 



Background of the invention 

15 There is an urgent need for new vaccine strategies against HIV. One such new promising 
strategy is called genetic immunisation or DNA vaccine (Webster et al 1997). Some of the 
advantages of a DNA vaccine against HIV is the induction of Th cell activation, induction of 
antibodies also against conformational dependent epitopes, and the induction of cellular 
immunity. So far, most DNA vaccine envelope genes tried, have been from tissue culture 

20 adapted virus strains (Boyer et al 1997) that often differs in several aspects from primary 
clinical isolates (such as early isolates) e.g. in co-receptor usage (Choe et al 1996, Dragic et 
al 1997). 

One disadvantage in HIV envelope based DNA vaccines may be the intrinsic relatively low 
25 expression which is regulated by the Rev expression. This may prevent an optimal 
investigation of the vaccines in small animal models like mice where Rev is functioning 
suboptimally. Recently it has been shown using the tissue culture adapted HIV-1 MN strain, 
that an exchange of the HIV codon usage to that of highly expressed mammalian genes 
greatly improves the expression in mammalian cell lines and renders the HIV expression Rev 
30 independent (Haas et al 1996). Additionally, it is known that rare codons cause pausing of 
the ribosome, which leads to a failure in completing the nascent polypeptide chain and a 
uncoupling of transcription and translation. Pausing of the ribosome is thought to lead to 
exposure of the 3' end of the mRNA to cellular ribonucleases. 
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The world-wide spread of HIV-1 has presently resulted in 8,500 new infections daily and 
AIDS is now the number 1 cause of death among US males (and number 3 among US 
females) aged 25-40 years. The epidemic hot-spots now include Eastern Europe, India and 
5 South East Asia and southern Africa. The attempts to solve this world-wide problem involve 
education, prevention, treatment and vaccine development. Affordable protective vaccines 
represent the best solution to the world-wide problem of infection with HIV-1 . Induction of 
virus neutralising antibodies is one of the key components in vaccine development. Several 
recombinant envelope vaccines have been tested in humans and animals, however, they 

10 seem unable to induce sufficient protection. In this respect DNA vaccination may provide a 
different and more natural mode of antigen presentation. It is hoped that the immune 
responses induced by such DNA vaccines could aid in limiting virus replication, slowing 
disease progression or preventing occurrence of disease. Unfortunately many HIV envelope 
vaccines induce only moderate levels of antibodies. This could in part be due to limitations in 

1 5 expression, influenced by regulation by the Rev protein and by a species-specific and biased 
HIV codon usage. Also the virus variability is considered a barrier for development of 
antibody based vaccines and thus a tool for more easy producing of closely related vaccine 
variants is needed. 

20 It has been suggested to improve the immunogenicity and antigenicity of epitopes by certain 
mutations in the envelope gene. An elimination of certain immune dominant epitopes (like 
V3) could render less immune dominant but more relevant, conserved, or hidden epitopes 
mere immunogenic (Bryder et al 1999). Also elimination of certain N-linked glycosylation 
sites could improve the exposure of relevant epitopes and increase the immunogenicity of 

25 those epitopes. Thus, it is possible that elimination of the glycosylation sites in V1 and V2 
may in a more favourable way expose neutralising epitopes (Kwong et al 1998, Wyatt et al 
1998). The HIV envelope contains putative internalisation sequences in the intracellular part 
of gp41 (Sauter et al 1996). Thus it would be relevant to eliminate and/or mutate the 
internalisation signals in a membrane bound HIV envelope vaccine gene to increase the 

30 amount of surface exposed vaccine derived HIV glycoproteins as gp150. Since the antibody 
response, that is measured and calculated in titers, is improved by adding the secreted 
gp120 as opposed to adding the membrane bound form (Vinner et al 1999), it could be 
advantageous to express the vaccine as a secreted gp120 or a secreted gp140. This would 
include important parts of gp41, such as the 2F5 neutralising linear epitope (Mascola et al 

35 1997). 
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Summary of the invention 

Our suggested solution to the problems described above is to design DNA envelope 
vaccines from a clinical primary isolate with Rev-independent high expression in mammals, 
that is built as a cassette for easy variant vaccine production. 

5 

A method of producing a nucleotide sequence construct with codons from highly expressed 
mammalian proteins based on a cassette system coding for an early, primary HIV envelope 
is described. The method comprises the steps of direct cloning of an HIV gene, derived from 
a patient within the first 12 months of infection, thereby obtaining a first nucleotide sequence; 

10 designing a second nucleotide sequence utilising the most frequent codons from mammalian 
highly expressed proteins to encode the same amino acid sequence as the first nucleotide 
sequence; .edesigning the second nucleotide sequence so that restriction enzyme sites 
surround the regions of the nucleotide sequence encoding functional regions of the amino 
acid sequence and so that selected restriction enzyme sites are removed, thereby obtaining 

15 a third nucleotide sequence encoding the same amino acid sequence as the first and the 
second nucleotide sequence; redesigning the third nucleotide sequence so that the terminals 
contain convenient restriction enzyme sites for cloning into an expression vehicle; producing 
snuts between restriction enzyme sites as well as terminal snuts and introducing snuts into 
an expression vehicle by ligation. The nucleotide sequence construct obtained by this 

20 method uses the mammalian highly expressed codons (figure 1) and renders the envelope 
gene expression Rev independent and allows easy cassette exchange of regions 
surrounded by restriction enzyme sites that are important for immunogenicity, function, and 
expression. 

25 The method can, for example, lead to the disclosed synthetic, Rev-independent, clinical 
(such as early), primary HIV-1 envelope vaccine gene, built as a multi cassette. From the 
sequence of the envelope of the HIV-1 BX08 isolate (personal communication from Marc 
Girard, Institute Pasteur, Paris), the present inventors have designed a synthetic BX08 HIV-1 
envelope vaccine nucleotide sequence construct. 

30 

With the great diversity of envelopes in HIV among different patients and within one patient, 
it would be of advantage to vaccinate with several envelope variants, all being highly 
expressed. To avoid synthesising several full length envelopes, it is much easier to 
exchange relevant parts of an envelope cassette to various strains in a multivalent vaccine. 
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4 

Whether it is the disclosed synthetic BX08 nucleotide sequence construct, or any of the 
nucleotide sequence constructs obtained by the method, they are designed to generate 
suitable DNA vaccines against HIV, specifically HIV-1. In this case the mammal, preferably a 
human being, is inoculated with the nucleotide sequence construct in an expression vehicle 
5 and constitutes a host for the transcription and translation of the nucleotide sequence 
construct. The nucleotide sequence constructs of the present invention can furthermore be 
used for the production of recombinant protein antigens. In this case the nucleotide 
sequence construct is placed in an expression vehicle and introduced into a system (e.g. a 
cell-line), allowing production of a recombinant protein with the same amino acid sequence. 
10 The recombinant protein is then isolated and administered to the mammal, preferably a 
human being. The immune system of the mammal will then direct antibodies against 
epitopes on the recombinant protein. The mammal, preferably a human being, can thus be 
primed or boosted with DNA and/or recombinant protein obtained by the method of the 
invention. 

15 

A relevant HIV DNA vaccine can potentially be used not only as a prophylactic vaccine, but 
also as a therapeutic vaccine in HIV infected patients, e.g. during antiviral therapy. An HIV 
specific DNA vaccine will have the possibility to induce or re-induce the wanted specific 
immunity and help the antiviral therapy in limiting or even eliminating the HIV infection. The 

20 immunogenicity and antigenicity of epitopes in the envelope can be improved by certain 
mutations in the envelope gene. The cassette system allows for easy access to the relevant 
parts of the envelope gene, and thereby eased efforts in the process of genetic manipulation. 
Some suggested mutations are: an elimination of certain immune dominant epitopes (like 
V3); elimination of certain N-linked glycosylation sites (like glycosylation sites around V2); 

25 elimination and/or mutation of the nucleotide sequence encoding the internalisation signals in 
the cytoplasmic part of a membrane bound HIV envelope to increase the amount of surface 
exposed vaccine derived HIV glycoproteins; elimination or mutation of the cleavage site 
between gp120 and gp41; with introduced mutations in gp41 for preserving conformational 
epitopes. 

30 

Table 1 below, lists the nucleotide sequence constructs of the invention by the names used 
herein, as well as by reference to relevant SEQ ID NOs of DNA sequences, and the amino 
acid sequence encoded by the DNA sequence in the preferred reading frame. It should be 
noted, that the snut name consist of the number of the approximate position for the end of 
35 the snut and the restriction enzyme used to cleave and/or ligate that end of the snut. 
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Table 1 List of names of nucleotide sequence constructs (Snuts (S) and Pieces (P» 
with reference to SEQ ID NO for the nucleotide sequence and protein sequence. 



Name 


Nucleotide SEQ ID 


Protein SEQ ID 




NO: 


NO: 


So-N-Lang 


1 


2 


S235EcoRV 


3 


4 


S375Pstl 


5 


6 


S495Clal 


7 


8 


S650-720EcoRI 


9 


10 


SgooXbal 


11 


12 


SggoSacI 


13 


14 


Smospei 


15 


16 


Si265Xhol 


17 


18 


Sl265gp120 


19 


20 


Si265gp160 


21 


22 


Si465Pstl 


23 


24 


SuBSPstl cys 


25 


26 


Sl630Xbal 


27 


28 


Sl700Eagl 


29 


30 


Si890Hindlll 


31 


32 


S2060Sacll 


33 


34 


S2190Clal 


35 


36 


S2330Pstl 


37 


38 


S2425ES 


39 


40 


Pi 


41 


42 


P 2 


43 


44 


P 3 


45 


46 


P3GV1 


47 


48 


P3 GV1V2 


49 


50 


P3GV2 


51 


52 


P4gp160 


53 


54 


P4gp150 


55 


56 


P4gp140 


57 


58 


Ps 


59 


60 


Pagpieo 


61 


62 


P8gp150 


63 


64 


Pegpl40 


65 


66 


synBX08-140 


67 


68 


synBX08-150 


69 


70 


synBX08-160 


71 


72 


synBX08-120 


73 


74 


synBX08-41 


75 


76 
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Detailed disclosure of the invention 

One aspect of the present invention relates to a method for producing a nucleotide sequence 
construct coding for an HIV gene. The nucleotide sequence construct is produced as a 
cassette system consisting of snuts. A snut (S) is a nucleotide sequences construct between 
5 restriction enzyme cleavage sites comprising the minimal entity of the cassette system. 

First an HIV gene is obtained from a patient within the first 12 months of infection. The term 
HIV should be understood in the broadest sense and include HIV 1 and HIV 2. It is possible 
to determine the period in which the infection has taken place with an accuracy depending 

10 on the frequency of the blood tests taken from the patient. For example, patients suffering 
from various diseases such as lack of certain factors in their blood or hepatitis have their 
blood tested on a regular basis making it possible to determine the period in which the 
infection has taken place. Apart from patients with diseases wherein blood tests are used to 
monitor the course of the disease, other groups of patients have blood tests taken, e.g. blood 

15 donors. Unfortunately, humans are still infected due to transfer of virus in blood samples, 
medical equipment, etc., making it possible to determine the date where the infection has 
taken place within the time frame of a few days. The importance of obtaining the virus early 
in the course of the infection is due to the known fact that many early isolates share the 
common feature of staying relatively constant in their envelope sequences (Karlsson et al., 

20 1998). As these early isolates may share cross-reactive antibody- and/or T-cell epitopes a 
vaccine based on such early isolates would have a better chance of inducing immune 
response to shared epitopes of the virus. It is believed that an early, directly cloned virus 
isolate will share immunogenic sites with other early virus isolates seen during an HIV 
infection, so that if a mammal generates antibodies and/or T-cells directed against these 

25 epitopes, the transferred virus will be eliminated prior to the extensive mutations that may 
occur after approximately 12 months of infection. Thus, the virus should be isolated as early 
as possible, that is within the first 12 months of infection, such as 1 1, 10, 9, 8, 7, 6, 5, 4, 3, 2, 
1, or 0.5 month after infection. 

30 The HIV gene for genetic vaccine is preferably cloned directly from viral RNA or from proviral 
DNA. Direct cloning in this application stands for the virus not being multiplied in stable cell 
lines in vitro. It is presently expected that passing the virus through a stable cell line will 
promote mutation in the virus gene. It is particularly preferred not to pass the virus through 
cells lines selecting for viruses with CXCR4 receptor usage. Direct cloning also includes 

35 multiplication of virus in e.g. PBMC (peripheral blood mononuclear cells) since all virus can 
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multiply in PBMC, and this type of multiplication generally does not select for CXCR4 
receptor usage. Multiplication of virus is often necessary prior to cloning. Preferably cloning 
is performed directly on samples from the patient. In one embodiment of the invention, 
cloning is performed from patient serum. The cloning is then performed directly on the HIV 
5 virus, that is on RNA. In another embodiment of the invention cloning, is performed from 
infected cells. The cloning is then performed on HIV virus incorporated in the genes in an 
infected cell (e.g. a lymphocyte), that is on DNA. In the latter case the virus might be a silent 
virus, that is a non-replicating virus. To evaluate if the virus is silent, capability of 
multiplication in e.g. PBMC is tested. 

10 

Cloning is a technique well known to a person skilled in the art. A first nucleotide sequence is 
hereby obtained. In another aspect of the invention, the first nucleotide sequence, sharing 
the properties mentioned with direct cloning, is obtained by other means. This could be from 
a database of primary isolates or the like. 

15 

Based on the first nucleotide sequence, the amino acid sequence encoded by said 
nucleotide sequence is determined. A second nucleotide sequence encoding the same 
amino acid sequence is then designed utilising the most frequent codons from highly 
expressed proteins in mammalians (e.g. figure 1 presenting the most frequent codons from 
20 highly expressed proteins in humans). 

Presently, it appears that the usage of the most frequent codons from mammalian highly 
expressed proteins has two advantages: 1) the expression is Rev independent; 2) the level 
of expression is high. The Rev independence is especially advantageous when performing 

25 experiments in mice where the Rev systems is functioning sub-optimally. For the use in 
human vaccine, Rev independence and high expression are important to increase the 
amount of antigen produced. The determination of the codons for high expression is in this 
context based on the statistics from human highly expressed proteins (Haas, Park and Seed, 
1996 hereby incorporated by reference). It is contemplated that the expression of a protein 

30 can be even higher, when current research in binding between codon (on the mRNA) and 
anticodon (on the tRNA) reveals codons with optimal binding capabilities, and when 
interactions in-between codons and/or in-between anticodons are known. 

The second nucleotide sequence designed utilising optimised codons is then redesigned to 
35 obtain a third nucleotide sequence. The purpose of the redesigning is to create unique 
restriction enzyme sites around the nucleotide sequence encoding functional regions of the 
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amino acid sequence. By having unique restriction enzyme sites around the nucleotide 
sequence encoding functional regions of the amino acid sequence, the nucleotide sequence 
encoding functional regions of the amino acid sequence can easily be isolated, changed, and 
re-inserted. Examples of functional regions of the amino acid sequence are transmembrane 
5 spanning regions, immunodominant regions, regions with antibody cross reacting domains, 
fusion domains and other regions important for immunogenicity and expression such as 
variable region 1 (V1), variable region 2 (V2), variable region 3 (V3), variable region 4 (V4) 
and variable region 5 (V5). 

10 It is important to select the restriction enzymes sites with care. By changing the second 
nucleotide sequence to insert restriction enzyme sites around the nucleotide sequence 
encoding functional regions of the amino acid sequence, the third nucleotide sequence must 
still code for the same amino acid sequence as the second and first nucleotide sequence do. 
Thus, if necessary, the second nucleotide sequence is redesigned by changing from 

15 optimised codons to less optimal codons. It is understood, that the restriction enzyme sites 
around the nucleotide sequence encoding functional regions of the amino acid sequence 
should preferably be placed in the terminal region of the nucleotide sequence encoding 
functional regions of the amino acid sequence. That is preferably outside the nucleotide 
sequence encoding functional regions of the amino acid sequence, such as 90 nucleotides ' 

20 away, e.g. 81, 72, 63, 54, 45, 36, 27, 21, 18, 15, 12, 9, 6,3 nucleotides away, but could also 
be inside the nucleotide sequence encoding functional regions of the amino acid sequence, 
such as 54, 45, 36, 27, 21, 18, 15, 12, 9, 6, 3 nucleotides inside the nucleotide sequence 
encoding the functional region of the amino acid sequence. 

25 The type of restriction enzyme sites allowed is determined by the choice of expression 
vector. In certain cases, the number of restriction enzyme sites is limited and it is hard, if not 
impossible, to place unique restriction enzyme sites around all the nucleotide sequences 
coding for functional regions of the amino acid sequence. This problem can be solved by 
dividing the entire nucleotide sequence into pieces, so that each piece comprises only 

30 unique restriction enzyme sites. Modifications to each of the piece is performed separately 
prior to assembly of the pieces. It is preferred that the nucleotide sequence is divided into 9 
pieces. In another aspect, the nuclotide sequence is divided into 8 pieces, or 7, or 6, or 5, or 
4, or 3, or 2 pieces. It is especially preferred that the nucleotide sequence is divided into 3 
pieces. 
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Thus, the redesign of the second nucleotide sequence is an interaction between the choice 
of cloning vector, expression vector, selection of restriction enzyme sites, dividing into 
pieces, and exchange of codons to insert restriction enzyme sites. In a preferred 
embodiment of the present invention the cloning vector is Bluescript allowing the restriction 
5 enzyme sites chosen from the group consisting of: Eag\, Mlu\, EcoRV, Pst\, C/al, EcoRI, 
Xba\, Sad, Spel, Xho\, Hind\\\, Sac\\, Nott, BamHI, Sma\, Sa/I, Dral, Kpnl If other cloning 
vectors are chosen, other restriction enzyme sites will be available as known by the person 
skilled in the art. 

10 As a part of the redesigning of the second nucleotide sequence, selected restriction enzyme 
sites may be removed. The selected restriction enzyme sites to be removed are those sites 
that are sites of the same type as the ones already chosen above and that are placed within 
the same piece. The removal of these restriction enzyme sites is performed by changing 
from optimised codons to less optimal codons, maintaining codons for the same amino acid. 

15 sequence. 

The third nucleotide sequence is redesigned so that the terminal snuts contain convenient 
restriction enzyme sites for cloning into an expression vehicle. The expression "vehicle" 
means any nucleotide molecule e.g. a DNA molecule, derived e.g. from a plasmid, 

20 bacteriophage, or mammalian or insect virus, into which fragments of nucleic acid may be 
inserted or cloned. An expression vehicle will contain one or more unique restriction enzyme 
sites and may be capable of autonomous replication in a defined host or vehicle organism 
such that the cloned sequence is produced. The expression vehicle is an autonomous 
element capable of directing the synthesis of a protein. Examples of expression vehicles are 

25 mammalian plasmids and viruses, tag containing vectors and viral vectors such as 

adenovirus, vaccinia ankara, adenoassociated virus, cannaryppx virus, simliki forest virus 
(sfv), Modified Vaccinia Virus Ankara (MVA), and simbis virus. In one embodiment of the 
invention, the expression vector contains tag sequences. In another embodiment of the 
invention a bacteria is transformed with an expression plasmid vector and the bacteria is 

30 then delivered to the patient. Preferred expression vehicles are simliki forest virus (sfv), 
adenovirus and Modified Vaccinia Virus Ankara (MVA). 

The snuts are produced by techniques well known by the person skilled in the art. The 
preferred method for synthesising snuts, is herein referred to as "the minigene approach" 
35 wherein complementary nucleotide strands are synthesised with specific overhanging 
sequences for annealing and subsequent ligation into a vector. This can be performed with 
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two sets of complementary nucleotide strands, or with three sets of complementary 
nucleotide strands. The minigene approach minimises the known PCR errors of mismatches 
and/or deletions, which may occur due to hairpins in a GC rich gene with mammalian highly 
expressed codons. In figures 10-21, the production of a representative selection of snuts is 
5 illustrated. 

For the production of long snuts, that is snuts of more than about 240 nucleotides, the 
technique of overlapping PCR is preferred as illustrated in figure 8. Herein two nucleotide 
strands about 130 nucleotides long with an overlap are filled to obtain a double strand, which 
10 is subsequently amplified by PCR. 

For the production of multiple snuts with a length of less than about 210 nucleotides, one 
preferred technique is normal PCR. In a preferred production technique the snuts are 
synthesised with the same 5' flanking sequences and with the same 3' flanking sequences, 
15 as illustrated in figure 9. The advantages of this approach is, that the same PCR primer set 
can be used for amplification of several different snuts. 

As known by the person skilled in the art, special conditions have to be used for each 
individual PCR reaction and it should be optimised to avoid inherent problems like deletions 
20 mismatches when amplificating GC rich genes from synthetic ssDNA material. Whichever of 
the above mentioned techniques are used, it is well known by the person skilled in the art, 
that it will be necessary to correct unavoidable mismatches produced either due to the 
nucleotide strand synthesis material and/or the PCR reaction. This can be performed by site 
directed mutagenesis techniques. 

25 

After the various snuts have been produced, they are assembled into pieces and 
subsequently into the complete gene. Methods for assembly (such as ligation) are well 
known by the person skilled in the art. 

30 

In a preferred embodiment of the present invention the HIV gene encodes the entire HIV 
envelope. It is understood that the HIV envelope can be the full length envelope gp160 as 
well as shorter versions such as gp150, gp140, and gp120 with or without parts of gp41 . 

35 As will be known by the person skilled in the art, the HIV is divided into several groups. 
These groups presently include group M, group O, and group N. Further, the HIV is divided 
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into subtypes A, B f C, D f E. F, G, H, I, and J. In the present invention subtype B is preferred 
due to the high prevalence of this subtype in the Western countries. 

The determination of groups and subtypes is based on the degree of nucleotide sequence 
5 identity in the envelope gene is presently defined as follows: If the sequence identity is more 
than 90% the viruses belong to the same subtype; If the sequence identity is between 80% 
and 90% the viruses belong to the same group. If the sequence identity is less than 80% the 
viruses are considered as belonging to different groups. 

10 One aspect of the invention relates to a nucleotide sequence construct in isolated form which 
has a nucleotide sequence with the general formula (I), (II), (III), or (IV) 

(I) Pl-S495Cla|-S650-720EcoRrP2"Si265gp120 

(II) Pl-S495ClarSe50-720EcoRrP2-Si265Xhor Si465Pstl" P4gp140 

(III) Pl-S495darS650-720EcoRrP2"Sl265Xhor Si 4 65Pstl~ P4gp150 

15 (IV) Pl-S495ClarS650-720EcoRI~P2"Si265Xhor Sl465Pstt" P4gp160~ S2060SaclP P5 

wherein Pi designates the nucleotide sequence SEQ ID NO:41 , a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 

wherein S 49 5aai designates the nucleotide sequence SEQ ID NO: 7, a nucleotide sequence 
20 complementary thereto, or a nucleotide sequence with a sequence identity of at least 95% 
thereto; 

wherein S 65 o.72oecori designates the nucleotide sequence SEQ ID NO: 9, a nucleotide 
sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 95% thereto; 

25 wherein P 2 designates the nucleotide sequence SEQ ID NO: 43, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein S 12 65g P i2o designates the nucleotide sequence SEQ ID NO: 19, a nucleotide 
sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
30 least 70% thereto; 

wherein S 12 65Xhoi designates the nucleotide sequence SEQ ID NO: 17, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 80% 
thereto; 

wherein S 146 5Psti designates the nucleotide sequence SEQ ID NO: 23, a nucleotide sequence 
35 complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 
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wherein P 4gp i4o designates the nucleotide sequence SEQ ID NO: 57, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4gp i5o designates the nucleotide sequence SEQ ID NO: 55, a nucleotide sequence 
5 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4gp i6o designates the nucleotide sequence SEQ ID NO: 53, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

10 wherein S 20 6osacii designates the nucleotide sequence SEQ ID NO: 33, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 98% thereto; and 

wherein P 5 designates the nucleotide sequence SEQ ID NO: 59, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% . 
15 thereto. 

The design of the parent synthetic BX08 gp160 envelope cassette gene with its variant 
length genes gp150, gp140, gp120 is outlined in figure 2. 

20 The nucleotide sequence construct with the formula (I) 

(I) Pl~S495Cla|-S 6 50-720EcoRrP2-S 1 265gp120 

(visualised in figure 3) (SEQ ID NO: 73) codes for the amino acid sequence of gp120 (SEQ 
ID NO: 74). This amino acid sequence is the part of the HIV envelope that is secreted. Thus, 
it contains the immunogenic epitopes without being bound to the cell membrane. This is of 
25 particular advantage if the nucleotide sequence construct is used for production of 

recombinant antigens or for a DNA vaccine as the antibody immune response may be higher 
to secreted versus membrane bound HIV antigens. 

The nucleotide sequence construct with the formula (II) 

30 (II) Pl-S 4 95ClarS 6 50-720EcoRrP2-Si265Xhor S 146 5Pstr P4gp140 

(visualised in figure 4) (SEQ ID NO: 67) codes for the amino acid sequence of gp140 (SEQ 
ID NO: 68). This amino acid sequence encodes the gp120 and the extracellular part of the 
gp41 protein. The amino acid sequence is secreted due to the lack of the transmembrane 
spanning region. This is of particular advantage if the nucleotide sequence construct is used 
35 for production of recombinant antigens as the immunogenic and/or antigenic epitopes in the 
extracellular part of gp41 are included and is of particular advantage for a DNA vaccine as 
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the antibody immune response may be higher to secreted gp120 versus membrane bound 
HIV antigens. 

The nucleotide sequence construct with the formula (III) 

5 (III) Pl-S495ClarS 6 50-720EcoRrP2-Si26SXhor Si465Pstr P4gp150 

(visualised in figure 5) (SEQ ID NO: 69) codes for the amino acid sequence of gp150 (SEQ 
ID NO: 70). This amino acid sequence contains all of the envelope protein gp160 except the 
c-terminal tyrosin containing internalisation signals in the intracellular part of gp41. The 
membrane bound surface expression of the amino acid sequence is thereby maintained and 
0 enhanced. Mimicking the organisation of the native epitope conformation may by expected, 
making this nucleotide sequence construct of particular advantage if the nucleotide 
sequence construct is used as a vaccine. 

The nucleotide sequence construct with the formula (IV) 

5 (IV) Pl-S 4 95ClarS 6 50-720EcoRrP2-Si265Xhor Si 46 5Psir P4gp160- S 2 060Sacir P5 

(visualised in figure 6) (SEQ ID NO: 71) codes for the amino acid sequence of gp160 (SEQ 
ID NO: 72) i.e. the entire envelope. 

The nucleotide sequence construct designated Pi comprises the nucleotide sequence 
0 encoding the amino acid sequence in the first variable region (V1) and the amino acid 
sequence in the second variable region (V2). In one embodiment of the invention the first 
variable region is surrounded by EcoRV and Pstl restriction enzyme sites, and the second 
variable region is surrounded by Pstl and Clal restriction enzyme sites but as stated above, 
the choice of restriction enzyme sites can alter. 

5 

The nucleotide sequence construct designated S 65 o-72oecori comprises the nucleotide 
sequence encoding the amino acid sequence in the third variable region (V3). In one 
embodiment of the present invention S 650 .72oecori is characterised by the restriction enzyme 
sites EcoRI and Xbal in the terminals. 

0 

The nucleotide sequence construct designated P 2 comprises the nucleotide sequence 
encoding the amino acid sequence of the fourth variable and constant region (V4 and C4). In 
one embodiment of the present invention the forth variable region is surrounded by Sad and 
Xho\ restriction enzyme sites. 
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The nucleotide sequence construct designated S 12B5gp120 comprises the nucleotide sequence 
encoding amino acid sequence of the fifth variable and constant region (V5 and C5). 
Si265g P i2o further comprises a nucleotide sequence encoding a C-terminal stop codon. 

5 The nucleotide sequence construct designated P 4 gp 14 o comprises the nucleotide sequence 
encoding amino acid sequence of the transmembrane spanning region. P 4gp14 o further 
comprises a nucleotide sequence encoding a C-terminal stop codon prior to the 
transmembrane spanning region. 
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10 The nucleotide sequence construct designated P 4gp16 o comprises the nucleotide sequence 
encoding amino acid sequence of the transmembrane spanning region (trans membrane 
spanning domain: TMD). In a preferred embodiment of the present invention the 
transmembrane spanning region is surrounded by Hindlll and Sacll restriction enzyme sites. 

15 The term "sequence identity" indicates the degree of identity between two amino acid 

sequences or between two nucleotide sequences calculated by the Wilbur-Lipman alignment 
method (Wilbur et al, 1983). 



The nucleotide sequence constructs with the formula (I). (II), (III), or (IV) illustrates the 
flexibility in the present invention. By producing a gene with the described method enables 
the production of a plethora of antigens with various immunogenic epitopes and various 
advantages for production and vaccine purposes. To further illustrate the flexibility of the 
invention, other changes and mutations are suggested below. 



In order to improve the immunogenicity of the nucleotide sequence constructs of the 
invention it is suggested to change the nucleotide sequence such that one or more 
glycosylation sites are removed in the amino acid sequence. By removal of shielding 
glycosylations, epitopes are revealed to the immunesystem of the mammal rendering the 
construct more immunogenic. The increased immunogenicity can be determined by an 
improved virus neutralisation. Changes in the nucleotide sequence such that one or more re- 
linked glycosylation sites are removed in the amino acid sequence is well known by the 
person skilled in the art. Potential glycosylation sites are N in the amino acid sequences N-X- 
T or N-X-S (wherein X is any amino acid besides P). The glycosylation site can be removed 
by changing N to any amino acid, changing X to a P, or changing T to any amino acid. It is 
preferred that N is changed to Q by an A to C mutation at the first nucleotide in the codon. 
and a C to G mutation at the third nucleotide in the codon. This is preferred to increase the 
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GC content in the nucleotide sequence construct. As an alternative N is changed to Q by an 
A to C mutation at the first nucleotide in the codon, and a C to A mutation at the third 
nucleotide in the codon. Preferred mutations in the synthetic BX08 envelope gene to remove 
potential N-linked glycosylate sites in V1 and/or V2 are A307C + C309A and/or A325C + 
5 C327G and/or A340C + C342A and/or A385C + C387A and/or A469C + C471A. Examples 
of such changes is illustrated in SEQ ID NOs: 47, 49, and 51. 

For historical reasons the HIVs have been divided into syncytia inducing strains and non 
syncytia inducing strains. The assay to determine whether a strain is syncytia inducing is 

10 described in Verrier et al 1997, hereby incorporated by reference. It is presently known, that 
viruses utilising the CXCR4 co-receptor are syncytia inducing strains. It is also, at the 
present, known that the binding site for the CXCR4 involves the third variable region (V3). In 
a preferred embodiment the nucleotide sequence construct is changed to create a binding 
site for the CXCR4 co-receptor. It is presently performed in the third variable regions, 

15 preferably by the mutation G865C + A866G. 

It is well established that the HIV envelope comprises immunodominant epitopes. An 
immunodominant epitope is an epitope that most antibodies from the mammal are directed 
against. The antibodies directed against these immunodominant epitopes may have little 

20 effect in elimination of the virus. It is therefore anticipated that modification of the 

immunodominant epitopes will induce antibodies directed against other parts of the envelope 
leading to a better elimination and neutralisation of the virus. By modification is understood 
any change in the nucleotide sequence encoding an immunodominant epitope in the amino 
acid sequence such that said amino acid sequence no longer contains an immunodominant 

25 epitope. Thus, modification includes removal of the immunodominant epitope and decrease 
of immunogenicity performed by mutagenesis. In a preferred embodiment of the present 
invention an immunodominant epitope in the third variable region (V3) is modified, such as 
deleted or altered. In a much preferred embodiment the nucleotides 793-897 are deleted. 
In yet another preferred embodiment of the present invention an immunodominant epitope 

30 has been removed from gp41 , such as deleted. This is performed in P 7 or P 8 by elimination 
of the nucleotides 1654-1710. 

It is anticipated that when gp120 is dissociated from gp41 in a vaccine or antigen, two 
immunodominant epitopes, one on each protein, are exposed and antibodies are directed 
35 against these in the mammal. In the infectious virus, gp120 is coiled on top of gp41 and the 
gp120/gp41 is most likely organised in a trimer, so that these immunodominant epitopes are 
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hidden and therefore less elimination of virus is observed. By removing the cleavage site 
between gp41 and gp120 a full length gp160, gp150, or gp140 can be obtained with a 
covalent binding between gp41 and gp120. Removal of the cleavage site between gp41 and 
gp120 is preferably performed by a mutation at position C1423A. An example of such a 
5 mutation is illustrated in the mutation of Si 26 5Xhoi (SEQ ID NO: 17) to S 12 65g P i6o (SEQ ID NO: 
21). 

In order to stabilise the full length gp160 t gp150, and gp140 for example when the cleavage 
site between gp41 and gp120 has been removed as described above, cysteins can be 
10 inserted, preferably inside the gp41 helix creating disulphide bounds to stabilise a trimer of 
gp41s. In a preferred embodiment of the present invention the cysteins are inserted by the 
mutation 161 8:CTCCAGGC: 1625 to 1618:TGCTGCGG:1625. An example of such a change 
is illustrated in SEQ ID NO: 25. 

15 The above mentioned decrease in immunodominant epitopes combined with the increase in 
immunogenicity of the other epitopes is expected to greatly enhance the efficacy of the 
nucleotide sequence construct as a vaccine. 

During the production of the nucleotide sequence construct, it is convenient to ligate the 
20 snuts into pieces. The pieces, as described above, are characterised by their reversible 
assembly as there are no duplicate restriction enzyme sites. In a preferred embodiment one 
piece (herein designated P 3 ) contains S 49 5ciai,S 8 5o-720EcoRt, and P 2 . Another piece (herein 
designated P 8 ) contains S 126 5Xhoi, S 146 5Psti, and P 4gp i6o. Yet another piece (herein designated 
P7) contains S^esxhoit S 146 5Psti. P4gpieo, S2oeosaciii and P 5 . 

25 

One advantage of the present nucleotide sequence construct is the easy access to 
exchange and alterations in the content and function of the nucleotide sequence and the 
encoded amino acid sequence. In one embodiment the nucleotide sequence coding for a 
functional region or parts thereof of the amino acid sequence is repeated. The repeat could 
30 be back-to-back or a functional region or parts thereof could be repeated somewhere else in 
the sequence. Repeated could mean two (one repetition) but could also be three, six, or nine 
repeats. In a much preferred embodiment the repetition nucleotide sequence codes for 
amino acids in the third variable region. 
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In order to improve the protective capabilities of the invention against infections with HIV, 
one embodiment of the invention relates to the combination of epitopes. The present 
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nucleotide sequence construct allows insertion of one or more new nucleotide sequences 
isolated from another group and/or subtype of HIV and/or isolated from another patient. 
Hereby a vaccine or antigen with two or more epitopes from two or more HIVs is obtained. In 
a preferred embodiment, the V3 is replaced by the new nucleotide sequence. In a much 
5 preferred embodiment, the new nucleotide sequence codes for amino acids in the third 
variable region of a different HIV isolate. 

In order to improve the efficacy of the vaccine, aiming at raising cellular immunity, a 
nucleotide sequence coding for a T-helper cell epitope is included in the nucleotide 

10 sequence construct. The nucleotide sequence coding for a T-helper cell epitope or a T- 
helper cell epitope containing amino acid sequence can be put in anywhere in the nucleotide 
sequence construct as long as it does not interact with the function of the envelope molecule. 
However, it is preferably placed in the tail of the nucleotide sequence construct or between 
the leader sequence and the envelope gene. The T-helper epitopes are preferably selected 

15 from core proteins such as P24gag or from a non-HIV pathogen such as virus, bacteria, e.g. 
BCG antigen 85. For a therapeutic vaccine an HIV helper epitope is preferred since the 
patient is already primed by the HIV infection. For a prophylactic vaccine, a T-helper cell 
epitope from a frequently occurring non HIV pathogen such as Hepatitis B, BCG, CMV, EBV 
is preferred. Also, since the synthetic BX08 envelope genes may contain T-helper cell 

20 epitopes in addition to important antibody epitopes, the synthetic BX08 vaccine genes can be 
mixed with other DNA vaccines to improve the efficacy of the other DNA vaccine. 

One aspect of the present invention relates to individualised immunotherapy, wherein the 
virus from a newly diagnosed patient is directly cloned, the envelope or subunits 

25 corresponding to snuts or pieces is produced with highly expressed codons, inserted into any 
of the nucleotide sequence constructs described above and administered to the patient as a 
vaccine. Hereby a therapeutic DNA vaccine is obtained, that will help the patient to break 
immunetolerance or induce/reinduce an appropriate immune response. In one embodiment 
the variable regions of the virus are produced with highly expressed codons and exchanged 

30 into any of the nucleotide sequence constructs described above. 

In one embodiment of the invention, the nucleotide sequence construct as described above 
satisfies at least one of the following criteria: 

a) serum extracted from a Macaque primate which has been immunised by administration of 
35 an expression vector containing the nucleotide sequence construct is capable of eliminating 
SHIV as determined by quantitative PCR and/or virus culturing. 
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b) serum extracted from a primate which has been immunised by administration of an 
expression vector containing the nucleotide sequence construct is capable of neutralising 
HIV-1 BX08 and /or other HIV-1 strains in vitro. 

c) serum, extracted from a mouse which has been immunised by administration of an 

5 expression vector containing the nucleotide sequence construct four times in intervals of 
three weeks and boosted after 15 weeks, is capable of decreasing the concentration of HIV- 
antigen in a culture of HIV, serum or PBMCs by at least 50%. An example of such procudure 
is shown in example 9. 

10 In one embodiment of the invention, the nucleotide sequence construct of the invention, is 
used in medicine. That is, it is used as a vaccine, for the production of a recombinant protein, 
such that the recombinant protein is used as a vaccine, or the nucleotide sequence construct 
or the recombinant protein is used in a diagnostic composition. 

Thus, the nucleotide sequence construct of the invention can be used for the manufacture of 
1 5 a vaccine for the prophylactics of infection with HIV in humans. 

Intramuscular inoculation of nucleotide constructs, i.e. DNA plasmids encoding proteins have 
been shown to result in the generation of the encoded protein in situ in muscle cells and 
dendritic cells. By using cDNA plasmids encoding viral proteins, both antibody and CTL 

20 responses were generated, providing homologous and heterologous protection against 
subsequent challenge with either the homologous or cross-strain reaction, respectively. 
The standard techniques of molecular biology for preparing and purifying DNA constructs 
enable the preparation of the DNA therapeutics of this invention. While standard techniques 
of molecular biology are therefore sufficient for the production of the products of this 

25 invention, the specific constructs disclosed herein provide novel therapeutics which can 
produce cross-strain protection, a result heretofore unattainable with standard inactivated 
whole virus or subunit protein vaccines. 

The amount of expressible DNA to be introduced to a vaccine recipient will depend on the 
strength of the transcription and translation promoters used in the DNA construct, and on the 

30 immunogenicity of the expressed gene product. In general, an immunologically or 

prophylactically effective dose of about 10 ng to 300 \ig is administered directly into muscle 
tissue. Subcutaneous injection, intradermal introduction, impression through the skin, 
inoculation by gene gun preferably DNA coated gold particles, and other modes of 
administration such as intraperitoneal, intravenous, peroral, topic, vaginal, rectal, intranasal 

35 or by inhalation delivery are also contemplated. It is also contemplated that booster 
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vaccinations are to be provided. It is further contemplated that booster vaccinations with 
recombinant antigens are to be provided, administered as described above. 

The DNA may be naked, that is, unassociated with any proteins, adjuvants or other agents 
5 which impact on the recipients immune system. In this case, it is desirable for the DNA to be 
in a physiologically acceptable solution, such as, but not limited to, sterile saline or sterile 
buffered saline. Alternatively, the DNA may be associated with surfactants, liposomes, such 
as lecithin liposomes or other liposomes, such as ISCOMs, known in the art, as a DNA- 
liposome mixture, (see for example WO93/24640) or the DNA may be associated with and 
10 adjuvant known in the art to boost immune responses, such as a protein or other carrier. 
Agents which assist in the cellular uptake of DNA, such as, but not limited to, calcium ions, 
detergents, viral proteins and other transfection facilitating agents may also be used to 
advantage. These agents are generally referred to as transfection facilitating agents and as 
pharmaceutical^ acceptable carriers. 

15 

Those skilled in the field of molecular biology will understand that any of a wide variety of 
expression systems may be used. A wide range of suitable mammalian cells are available 
from a wide range of sources (e.g. the American Type Culture Collection, Rockland, Dm; 
also, see e.g. Ausubel et al. 1992). The method of transformation or transfection and the 
20 choice of expression vehicle will depend on the host system selected. Transformation and 
transfection methods are described e.g. in Ausubel et al 1992; expression vehicles may be 
chosen from those provided e.g. in P.H. Pouwels et al. 1985. 

In one embodiment of the present invention the protein encoded by the nucleotide sequence 
25 construct is produced by introduction into a suitable mammalian cell to create a stably- 
transfected mammalian cell line capable of producing the recombinant protein. A number of 
vectors suitable for stable transfection of mammalian cells are available to the public e.g. in 
Cloning Vectors: A Laboratory manual (P.H. Pouwels et al. 1985); methods for constructing 
such cell lines are also publicly available, e.g. in Ausubel et al. 1992. 

30 

Standard reference works describing the general principles of recombinant DNA technology 
include Watson, J.D. et al 1987; Darnell, J.E. et al 1986; Old, R.W. et al, 1981; ManiatisJ. et 
al 1989; and Ausubel et al.1992. 
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Figure legends 

The invention is further illustrated in the following non-limiting examples and the drawing 
wherein 

5 Figure 1 provides the codon preference of highly expressed proteins in human cells. 

Figure 2 illustrates the outline of gp120 ( gp140, gp150, and gp160 encoding synthetic genes 
derived from the wild type sequence at the top. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain 
10 (TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 
The approximate position of the three restriction enzyme sites dividing the full-length 
gp160 gene into the three pieces each containing only unique restriction enzyme sites are 
shown in bold. 

15 Figure 3 building of the synthetic gp120 gene. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 4 building of the synthetic gp140 gene. Variable (V) and constant (C) regions are 
20 shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 5 building of the synthetic gp150 gene. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
25 TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 6 building of the synthetic gp160 gene. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

30 

Figure 7 illustrates the codons coding amino acids in general 
Figure 8 illustrates how overlapping PCR is performed. 
35 Figure 9 illustrates how PCR using conserved flanking ends is performed. 
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Figure 10 illustrates how S 1265 xhoi is produced using complementary strands (minigene- 
approach) technology. The S 1265 xhoi is ligated from three sets of complementary strands 
into the vector pBluescript KS + between restriction enzyme sites Xho\ and Pstt. 

Figure 1 1 illustrates how S 146 5Psti is produced. The same approach, as the approach used for 
the production of S 126 5Xhoi. was used except that only two sets of complementary strands 
were used. 

Figure 12 illustrates the assembly of P,. The S 0 .N-Lang and S 235EcoRV are ligated into the Xba\ 
and Pst\ site of the S 37 5P S ti containing plasmid. 

Figure 13 illustrates the assembly of P 2 . The S 900X bai was excerted by HindW and Sad from 
its plasmid and ligated with S 990 saci (Sacl-Spel) into the S 110Spa , plasmid that was opened 
at the Hind\\\ and Spel sites. 

Figure 14 illustrates the assembly of P 3 . S 49 5ciai (C/al-EcoRI) and S 65 o-72oecori (EcoR\-Xba\) 
and P 2 (Xba\-Xho\) were ligated simultaneously into the P^ plasmid opened at the C/al 
and Xho\ sites to obtain the P 3 plasmid. 

Figure 15 illustrates the assembly of P 4gp1 8o. S 1M oHindm (Sac\-Hind\ II) and S 170 oea g . (Hind\\\- 
Eag\) were ligated simultaneously into the S 1630 xhai plasmid opened by Sacll and Eagl. 

Figure 16 illustrates the assembly of P s . S 2190 c,ai (Cla\-Pst\) and S 2330 Psti (Psfl-EcoRI) were 
ligated into the S 2425Es plasmid opened by C/al and EcoRI. 

Figure 17 illustrates the assembly of P^o. S 1465Pst , (Xba\-Pst\) and S 1265X hoi (Pst\-Xho\) were 
ligated into the P 4gp16 o plasmid opened by Xba\ and Xho\. 

Figure 18 illustrates the assembly of P^o. S 1465 p s ,i (Xba\-Pst\) and S 1265X hoi (Pst\-Xho\) were 
ligated into the plasmid containing P 4gp1S o with the stop codon. P 4gp150 plasmid was opened 
at the Xba\ and Xho\ sites for the ligation. 

Figure 19 illustrates the assembly of P 8gp140 . S 1465P „i (Xba\-Pst\) and S 1265 xhoi (Pstt-Xho\) were 
ligated into the plasmid containing P 4gp140 with a stop codon. P 4gp140 plasmid was opened 
at the Xba\ and Xho\ sites for the ligation. 
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Figure 20 illustrates the assembly of Pag^i. Two complementary nucleotide strands 
1265gp41S and 1265gp41AS designed with overhang creating a 5' Xho\ and a 3' Pst\ 
restriction enzyme site were anealed and ligated into the piece 8 which is already opened 
5 at the Xho\ and Ps/I sites whereby S 126 5 is deleted. 

Figure 21 illustrates the assembly of P 7 . P 8 (Xrtol-Sacll) and Saoeosacii (Sac\\-Cla\) were 
ligated into P 5 plasmid opened at Xho\ and C/al. 

10 Figure 22a SDS PAGE of 35 S-labelled HIV-1 BX08 envelope glycoproteins radio-immuno 
precipitated from transiently transfected 293 cells using the indicated plasmids. Cell pellet 
(membrane bound antigens) or cell supernatant (secreted antigens) were precipitated by 
a polyclonal anti-HIV-1 antibody pool. Lane 1 : untransfected cells. Lane 2: supernatant 
from syn.gp120 MN transfected cells. Lane 3: cell pellet from wt.gp160 B xo8 transfected cells. 

15 Lane 4: cell pellet from cells co-transfected by wt.gp160Bxos and pRev. Lane 5: Mwt. 
marker. Lane 6: cell pellet from syn.gp160 BX 08 transfected 293 cells. Lane 7: cell pellet 
from syn.gp150 B xoe transfected 293 cells. Lane 8: supernatant from syn.gp140 BX0 8 
transfected cells. Lane 9: supernatant from syn.gp120 B xos transfected cells. 

20 Figure 22b is an SDS-PAGE of 35 S-labeled HIV-1 BX08 envelope glycoproteins radio- 
immune precipitated from transiently transfected 293 cells as cell pellet (membrane 
bound) or cell supernatant (secreted antigens) by anti-HIV-1 antibody pool using the 
indicated plasmids. Lane 1: untransfected 293 cells. Lane 2: cell pellet from syn.gp160MN 
transfected 293 cells as positive control (Vinner et al 1999). Lane 3: Cell supernatant from 

25 syn.gp120MN transfected 293 cells as positive control (Vinner et al 1999). Lane 4: Cell 
supernatant from syn.gp120BX08 transfected 293 cells demonstrating a glycoprotein 
band of 120 kDa. Lane 5: Cell supernatant from syn.gp140BX08 transfected 293 cells 
demonstrating a glycoprotein band of 120 kDa. Lane 6: Mwt. marker. Lane7 at two 
different exposure times: Cell pellet from syn.gp150BX08 transfected 293 cells 

30 demonstrating a glycoprotein band of 120 kDa (lower gp30 band is not well seen in this 
exposure). Lane 8: Cell supernatant from syn.gp150BX08 transfected 293 cells showing 
no secreted proteins (all protein is membrane bound, see lane 7). 

Figure 22c show fluorescent microscopy of U87.CD4.CCR5 cells transfected with BX08 
35 gp160 genes plus pGFP. Panel A: cells transfected with empty WRG7079 vector plus 
pGFP showing no syncytia. Panel B: cells transfected with wild type BX08gp160 gene 
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plus pGFP showing some syncytia. Panel C: cells transfected with synBX08gp160 plus 
pGFP showing extreeme degree of syncytia formation. This demonstrates expression, 
functionality, and tropism of the expressed BX08 glycoprotein with much more expressed 
functionally active gp160 from the synthetic BX08 gene. 

5 

Figure 23 shows the anti-Env-V3 BX08 antibody titers (lgG1). Panels show individual mice 
DNA immunized with syn.gp140BX08 plasmid either i.m. (left panel) or by gene gun (right 
panel), respectively. Immunization time points are indicated by arrows. 



10 Figure 24 shows a Western Blotting of (from left to right) one control strip, followed by sere 
(1:50) from 2 mice i.m. immunized with synBX08gp120, 2 mice i.m. immunized with 
synBX08gp140, 2 mice i.m. immunized with synBX08gp150, and 2 mice immunized with 
synBX08gp160, followed by 2 mice gene gun immunized with synBX08gp120, 2 mice 
gene gun immunized with synBX08gp140, 2 mice gene gun immunized with syn 

15 BX08gp150, and 2 mice gene gun immunized with synBX08gp160 respectively. Strip 5 is 
a mouse 5.1 DNA immunized i.m. with synBX08gp140 plasmid (same mouse as in figure 
23). Plasma was examined at week 18. The positing of gp160 (spiked with four coupled 
gp51), gp120 and gp41 is indicated at the right. A positive reaction to HIV glycoproteins 
futher demonstrates the mouse anti-HIV immunoglobulin reacting to HIV of a strain (1MB) 

20 different from BX08 to illustrate cross-strain reactivity. 

Figure 25 Theoretical example of calculation of the 50% inhibitory concentration (IC 50 ) 
values. IC 5 o for each mouse serum is determined by interpolation from the plots of percent 
inhibition versus the dilution of serum. 

25 

Figure 26 CTL responses were measured at week 18 to the mouse H-2D d restricted BX08 
V3 CTL epitope (IGPGGRAFYTT) for BALB/c mice (H-2D d ) i.m. immunized at week 0, 9, 
and 15 with the synthetic vaccine genes: syn.gp120 B xoe, syn.gp140 B xoe, syn.gp150 B xoe. 
and syn.gp160 BX 08, respectively, and median values of different E:T ratios for groups of 
30 mice are shown (26A). Intramuscular DNA immunization with syn.gpl 50 BX o8 induced a 
higher CTL reponse when injected i.m. in high amounts versus gene gun inoculation of 
skin (26B). 



35 



Figure 27 Summary of western immuno blotting assay of mice sera (1 :40) collected at week 
0, 9, and 18 from mice genetically immunized with syn.gpl 20 BX0 8. syn.gp140 BX0 8, 
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syn.gp150 BX08 , syn.gp160 BX08 , wt.gp160 B xoB, and wt.gp160 BX08 plus pRev, respectively. 
Percent responders in groups of 17-25 mice against gp!20 and gp41 are shown. 

Figure 28 IgG anti-rgp120 (IIIB) antibody titers of individual mice inoculated at week 0, 9, 15 
(28A), or gene gun immunized at week 3, 6, 9, and 15 (28B) with the syn.gp150 BX08 DNA 
vaccine. 



Figure 29 IgG antibody titers to HIV-1 rgp120m B . Median titers are shown from groups of 
mice i.m. inoculated at week 0, 9, and 15 (29A), or gene gun immunized at week 0, 3, 6, 
9, and 15 (29B) with the synthetic genes syn.gp120 BX0 e, syn.gp140 BX08 , syn.gp150 BX08 , 
and syn.gp160 BX os, respectively. 
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Examples 

Example 1 : Designing the nucleotide sequence construct 

Initially the overall layout of the nucleotide sequence construct is decided. The overall layout 
comprises the various derivatives the gene will be expressed as. For BX08 these include, but 
5 are not restricted to gp160, gp150, gp140, gp120, and gp41 . 

Next, the vehicle of expression (plasmid or virus) is to be determined: Preparation for a 
suitable vector determines both need for leader sequence, terminal restriction enzyme sites 
and whether or not an N- or C-terminal protein tag is to be considered (Poly-his, Myc- 
antibody-epitop, etc.). For BX08 a plasmid expression vehicle was chosen. All native wild 

1 0 type HIV codons are systematically exchanged with the codons most frequently represented 
in a pool of highly expressed human genes (figure 1). By this exchange the amino acid 
sequence is conserved while the nucleotide sequence is dramatically altered. Thus, gene 
structures like overlapping reading frames (e.g. vpu, rev, and far) or secondary structures 
(e.g. RRE) are most likely destroyed whereas protein cleavage sites, and glycosylation sites 

15 are maintained. The 100% amino acid identity between wtBX08 and synthetic BX08 in the 
present examples should be calculated after the initial Ala-Ser amino acid sequence, as that 
sequence is a part of the 6 amino acid sequence long Nhe\ restriction enzyme site. 

Depending on the restriction enzyme sites located in the expression vector it is decided 
20 which restriction enzyme sites can be present (tolerated) throughout the finished gene 
construct. The terminal restriction enzyme sites of the synthetic gene must remain unique to 
enable cloning into the vector chosen for expression. General requirements for restriction 
enzyme sites of choice: Preferably creating cohesive ends facilitating ligation, creating no 
compatible ends with adjacent restriction enzyme sites (e.g. BamHVBgM), and being efficient 
25 cutters. For BX08 the restriction enzyme sites accepted were the ones present in the 
polylinker of the pBluescript cloning vectors (Eagl, Mlu\, EcoRV, Psrl, C/al, EcoRI, Xoal, 
Sad, Spel, X/joI, HindW, SacU, Not\, BamH\, Sma\, Sa/I, Oral, Kpn\ with the exception of 
Sg/ll and /Mel). This was decided to satisfy the original cloning strategy using individual 
cloning of snuts in pBluescript with restriction enzyme cleaved (trimmed) ends after PCR 
30 amplification, which is not necessary when blunt-end cloning and assembling of 

complementary oligonucleotides are employed. All locations at which the selected restriction 
enzyme sites can be introduced by silent mutations (keeping 100% loyal to the amino acid 
sequence) are identified using the SILMUT software or equivalent. 
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From these possible restriction enzyme sites, a selection of restriction enzyme sites are 
introduced by silent nucleotide substitutions around functional regions of choice of the 
corresponding gene (e.g. RRE) or gene products (e.g. variable region 1 (V1), V2, V3, CD4 
binding area, transmembrane domain, and regions of immunological significans, etc.). 
5 Restriction enzyme sites are located at terminal positions of subcloned snuts (building 
entities) but additional restriction enzyme sites may be present within subunits. For BX08 the 
construct was initially to be cloned in the WRG7079 vector containing a tPA-leader 
sequence. Cloning sites were S'-Nhe\ -► BamHI-3\ The entire humanised BX08 sequence 
was divided into thirds: 5'-Nhe\ -+ Xho\ Sacll Ba/nHI-3'. These sites were chosen in 
10 this particular order because it resembles the polylinker of pBluescript (KS ) enabling 
successive ligations of the assembled thirds in this cloning vector. Within these thirds 
restriction enzyme sites were kept unique. Next, restriction enzyme sites were placed to flank 
the functional regions chosen as follows: 

A. (5'-V1): EcoRV-235: Between C1 and V1. Alternatives: 3xH/ndlll (already excluded 
15 because exclusive use at position 1890) or EcoRV. 

B. (V1-3'): Only alternative Ps/l 375. 

C. (5-V2): as B. 

D. (V2-3'): Alternatives: Spel, C/al 495. C/al chosen because it is closer to V2. 

E. EcoRI 650 placed because next possible site was too far away. 

20 F. (5-V3): Bg/ll 720 was the alternative closest to the V3 region and further more unique. 

G. (V3-3'): Alternatives Xho\ (excluded) and Xoal 900 located very close to the V3 loop. 

H. (5'-V4) Sad 990: alternatively EcoRI or BamHI (both excluded) 

I. (V4-3 1 ): Alternatives Spel 1110, Kpn\ 1145, Pst\ 1135. Pst\ already used, Spel chosen 
because of distance to previous site (Sacl 990). 

25 J. (5-V5): Xhol initially determined. 

K. (Fusion peptide-3') Psti 1465 was the closest alternative to Xho\ 1265. 

L. ^'-Immunodominant region): Xba\ 1630 chosen among EcoRV (blunt end), Pst\ and Xho\ 

(both already used). 

M. (Immunodominant region-3'): Eagl 1700 perfect location. 
30 N. (C34 and C43 -3' (Chan, Fass, et al. 1997), and 5'-trans membrane domain): Sacll. No 
alternatives. 

O. (Trans membrane domain -3'): Sacll 2060 already present. 
P. C/al 2190 perfect position in relation to previous RE-site. 
Q: Pstl perfect position in relation to previous RE-site. 
35 R. EcoRI 2400 introduced to facilitate later substitution of terminal snut. 
S. BamHI 2454 determined by the WRG7079 vector. 
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Remove undesired restriction enzyme sites by nucleotide substitutions (keeping loyal to the 
amino acid sequence). Nucleotide substitution should preferably create codon frequently 
used in highly expressed human genes (figure 1). If that is not possible, the codons should 
be the selected from the regular codons (figure 7). The substitutions made to the second 
nucleotide sequence to obtain desired restriction enzyme sites are shown in Table 2. 



Table 2 lists silent nucleotide substitutions in the humanised BX08 envelope 
sequence. Substitutions were made to create or delete restriction enzyme sites. 



Position: 


substitution 


Remarks: 


138 


c -> g 


creates Mlu I site on pos. 134-139 


240 


c->t 


creates EcoRV site on pos. 238-243 


501 


c -> a 


creates Cla I site on pos. 501-506 


502 


a ->t 


do 


503 


9 -> c 


do 


504 


c -> g 


do 


657 


c-» a 


creates EcoRI site on pos. 656-661 


660 


c->t 


do 


724 


c -> a 


creates Bgl II site on pos. 724-729 


726 


c-> g 


do 


727 


a -> t 


do 


728 


g c 


do 


729 


c->t 


do 


840 


c->\ 


Eagl site is eliminated 


904 


a -> t 


creates Xba I site on pos. 904-909 


905 


g -» c 


do 


906 


c->t 


do 


907 


c -> a 


do 


909 


c -> a 


do 


994 


a->t 


creates Sac I site on pos. 990-995 


995 


g -> c 


do 


1116 


c->t 


creates Spel site on pos. 1 1 14-1 119 


1119 


c->t 


do 


1273 


a ->t 


creates Xhol site on pos. 1272-1277 


1274 


g -> c 


do 


1275 


c-*g 


do 


1293 


c->t 


Bgl II site is eliminated 


1443 


c-M 


BstXI site is eliminated 


1452 


g c 


do 


1467 


c->t 


Pstl site on pos. 1466-1471 


1470 


c -> a 


do 


1590 


g -»c 


Pstl site on pos. 1588-1593 is eliminated 


1620 


g -» c 


Pstl site on pos. 1618-1623 is eliminated 


1638 


c->t 


creates Xbal site on pos. 1638-1643 


1641 


g a 


do 


1653 


g -> c 


Pstl site is eliminated 


1687 


a->t 


Pstl site is eliminated 


1688 


g -> c 


Pstl site is eliminated 


1710 


c -> g 


creates Eagl site on pos. 1709-1714 


1758 


c-»t 


Bgl II site is eliminated 


1875 


g->c 


Pstl site is eliminated 


1893 


c -> a 


Hind III on pos. 1893-1898 


1897 


c->t 


do 


1944 


c->t 


Bgl II site is eliminated 



5 



10 



WO 00/29561 



28 



PCT/DKOO/00144 



Position: 



substitution 



Remarks: 



2199 
2202 
2203 
2253 
2292 
2320 
2321 
2322 
2325 
2430 
2433 



c -> a 
c -> a 
c->t 



c->t 
c-»t 
c->t 
c-> g 
g-»c 
a->t 
a->t 
9->t 



creates EcoRI site on pos. 2429-2434 
do 



do 

do 
do 



do 
do 



Sacll site is eliminated 

Pstl site is eliminated 

Pstl site on pos. 2321-2326 is eliminated 



Cla I site on pos. 2198-2203 



Example 2: synthesis of oligos 

In order to clone the individual snuts, nucleotide strands were synthesised or purchased. In 
total 28 synthetic nucleotide strands were synthesised. Nucleotide strands were synthesised 
5 by standard 0.2 htioI p-cyanoethyl-phosphoramidite chemistry on an Applied Biosystems 
DNA synthesiser model 392, employing 2000 A CPG columns (Cruachem, Glasgow, 
Scotland), acetonitrile containing less than 0.001 % water (Labscan, Dublin, Ireland) and 
standard DNA-synthesis chemicals from Cruachem, including phosphoramidites at 0.1 M 
and Tetrahydrofuran/N-methylimidazole as cap B solution. The nucleotide strands O-N-C 

10 and 1 19MS-RC (for cloning of snut O-N-Lang), 650-E-BG and 720-XBAC-31 (for cloning of 
snut 650-720-EcoRI), 2425esup and 2425ESdo (for cloning of snut 2425-E-S) were 
synthesised with 5' end "trityl on" and purified on "Oligonucleotide Purification Cartridges" 
(Perkin Elmer, CA, USA) as described by the manufacturer. Other nucleotide strands (235- 
EC05, 375-pst1.seq, 495-Cla1.seq,900-Xbal, 990-sad, 1110-SPE, 1630-Xba.seq, 1700- 

15 Eag.seq, 17-Eag.seq, 1890-Hind.MPD, 2060-sac, 2190-cla, 2330-pst) were synthesised with 
5' end "trityl off' and purified by standard ethanol precipitation. Oligoes 1265-1 UP, 1265- 
1DO, 1265-2UP, 1265-2DO, 1265-3UP, 1265-3DO, 1465-1 UP, 1465-1 DO, 1465-2UP, 1465- 
2DO were purchased from Pharmacia. 

Example 3: Cloning of snuts 
20 The nucleotide sequence construct was designed in 17 DNA small pieces called snuts 
(Table 3) encompassing important structures like variable and constant regions each flanked 
with restriction enzyme (RE) sites to facilitate cassette exchange within each third of the 
gene: Nhel-Xho\, Xhol-SacW, Sacll-BamHI. 

25 Each snut was cloned individually in a commercial vector (pBluescriptKS or pMOSblue) and 
kept as individual DNA plasmids, named after the snut which gives the nucleotide position of 
the RE in the BX08. 



WO 00/29561 



29 



PCT/DKOO/00144 



Table 3 list the snuts by their name and cloning vector. 



Name 




Q 

°0-N-Lang 


piviuouiue 


o 

°235EcoRV . 


piVlvJOIJiUC 


o 

°375Pstl 


fJDIUcoUi ipiOr\ 


o 

°495Clal 


piviuouiuc 


^eso^oEcoRi 


pivlwOUIU6 


^sooxbal 


nMD^hli 10 

piVIVJOUlUC? 


o 

^990Sacl 


pivlvJOUlUw 


Q 

^mospei 


nMOQhli 1a 
fJivivJOulUe 


e 

°1265Xhol 


pDiuescnpioi\ 


o 

°1465Pstl 


poiuescnpiorv 


w lDOUADdl 


oBiuMcrintSK 


Si700Eagl 


pBluescriptSK 


Si890Hindlll 


pBluescriptSK 


S2060Sacil 


pMOSblue 


S2190Clal 


pMOSblue 


S2330Pstl 


pMOSblue 


S2425ES 


pBluescriptSK 



Three principally different methods were used to obtain the dsDNA corresponding to each of 
5 the 17 snuts needed to build the synthetic BX08 genes. 

1) "Overlapping" PCR: is based on the use of two ssDNA template nucleotide strands 
(forward and reverse) that complement each other in their 3'-end (figure 8). During the first 
PCR cycle, both templates annealed to each other at the 3'-ends allowing the full-length 
10 polymerisation of each complementary strand during the elongation step. The newly 

polymerized dsDNA strand are then amplified during the following cycles using an adequate 
forward and reverse primers set (figure 8). 

Snut O-N-LANG: (So-N-Lang) two ng of the forward template nucleotide strands O-N-C and 2 
15 ng of the reward template-nucleotide strand 1 19MS-RC were mixed together with 50pmoles 
of the forward primer O-N-LANG-5 (5'-CTAGCTA-GCGCGGCCGACCGCCT -3') and 
50pmoles of the reverse primer O-N-LANG-3 (5-CTCGATATCCTCGTGCATCTGCTC -3) in 
a 100^1 PCR reaction volume containing 0.2mM dNTP's, 1x ExpandHF buffer with MgCI 2 
(1.5mM) and 2.6 units of enzyme mix (Expand™ High fidelity PCR system from Boehringer 
20 Mannheim). The PCR was performed with the PE Amp 9600 thermocycler (Perkin Elmer) 
using the following cycle conditions: initial denaturation at 94°C for 30 sec, followed by 30 
cycles of 94°C for 15 sec, 65°C for 30 sec, 72°C for 45 sec, with a final elongation at 72 8 C 
for 5 min., and cooling to 4°C. 
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Snut 650-720-EcoRI: (S 65 o.72oecori) PCR amplification was performed as described for snut 
O-N-LANG. One \ig of the forward ssDNA template-oligonucleotide 650-E-BG and 1 jig of 
the reverse ssDNA template-oligonucleotide 720-XBAC were mixed with 40pmoles of the 
forward primer 650-E-5 (5'-CCGGAATT-CGCCCCGTGGTGAGCA-3') and 40 pmoles of the 
5 reverse primer 720-X-3 (5-CTGCTCTAGAGATGTTGCAGTGGGCCT-3'). 

2) "Normal" PCR amplification: Eleven nucleotide strands: 235-EC05, 375-pst1, 900-xba1, 
990-sad, 1110-SPE, 1630-XBA, 1700-EAG, 1890-HIN, 2060-sac, 2190-cla, and 2330-pst, 
were designed with common 5' and 3" flanking sequences which allowed PCR amplification 

10 with the same primer set (Forward primer : BX08-5 (5'-AGCGGATAACAATTTCACACAGGA- 
3') and revers primer : BX08-3 (5'-CGCCAGGGTTTTCCCAGTCACGAC-3') (Figure 9). The 
495-Cla1 oligonucleotide was designed without a common flanking sequence and was 
therefore amplified with a specific set of primers 495-5N/495-3N ( 5'-GAATCGAT- 
CATCACCCAG-3' and 5*-GACGAATTCCGTGGGTGCACT-3'). Each oligonucleotide was 

1 5 resuspended in 1 ml of water and kept as a stock solution (approximatively 0.2 mM). PCR 
amplification was performed with the Expand™ High Fidelity PCR System from Boehringer 
Mannheim (Cat. No. 1759078). Four concentrations of template nucleotide strand were 
systematically used: undiluted stock solution, stock solution 10" 1 , stock solution 10* 2 , stock 
solution 10" 3 . One to 5 of synthetic ssDNA template was amplified using the following 

20 conditions: BX08-5 (0.5>iM), BX08-3 (0.5nM), 4 dNTP's (0.2mM), 1x ExpandHF buffer with 
MgCI 2 (1.5mM) and 2.6 units of enzyme mix. The PCR was performed using the PE Amp 
9600 thermocycler (Perkin Elmer) using the following cycle conditions: initial denaturation at 
94°C for 15 sec, followed by 30 cycles of 94°C for 15 sec, 65°C for 30 sec and 72°C for 45 
sec, with a final elongation at 72°C for 7 min., and cooling to 4°C. 

25 

3) Minigene approach : This method was used to synthesise S 1265 xhoi. Suesxtai and S 242 5es. 
Snut 2425-E-S (S 2425E s): 100 picomoles of each oligonucleotide 2425ES-up (35-mer ; 5'- 
AATTCGCCAGGGCTTCGAGCGCGCCCTGCTGTAAG-3*) and 2425ES-do (35-mer ; 
GATCCTTACAGCAGGGCGCGCTCGAAGC-CCTGGCG-3') were mixed together in a 100 

30 final volume of annealing buffer containing NaCI 25mM , Tris 10mM and 1mM EDTA. After 
denaturation at 94°C for 15 min., the mixed oligonucleotides were allowed to anneal at 65°C 
during 15min.. The annealing temparature was allowed to slowly decrease from the 65°C to 
room temperature (22°C) during overnight incubation. The resulting double-strand dsDNA 
fragments harbored EcoRI- and BamHI-restriction sites overhangs that allowed direct cloning 

35 in pBluescript KS(+) vector using standard cloning techniques (Maniatis 1996). 
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Snut 1265-Xhol (Si 26 5Xhoi): This snut was built according to the strategy depicted in figure 10. 
Three minigenes were constructed following the same method described for snut 2425-E-S. 
These minigenes are named 1265-1, 1265-2 and 1265-3. The minigene 1265-1 results from 
the annealing of the oligonucleotides 1265-1 up (68-mer ; 5-tcg agc agc ggc aag gag att 

5 TTC CGC CCC GGC GGC GGC GAC ATGC GCG ACA ACT GGC GCA GCG AGC T-3') and 1 265-1 do (68- 
mer ; 5-GTA CAG CTC GCT GCG CCA GTT GTC GCG CAT GTC GCC GCC GCC GGG GCG G AAA ATC 

tcc ttg CCG ctg c-3'). 1265-2 results from the annealing of 1265-2up (61-mer ; 5'-gta caa 
GTA CAA GGT GGT GAA GAT CGA GCC CCT GGG CAT CGC CCC CAC CAA GGC CAA GCG C-3') and 
1 265-2d0 (63-mer ; 5'-CAC GCG GCG CTT GGC CTT GGT GGG GGC GAT GCC CAG GGG CTC GAT 
1 0 ctt cac cac ctt gta ctt-3'). Finally, 1265-3 results from the annealing of 1265-3up (69-mer 

; 5-CGC GTG GTG CAG CGC GAG AAG CGC GCC GTG GGC ATC GGC GCT ATG TTC CTC GGC TTC CTG 

ggc gct gca-3') and 1265-3do (59-mer ; 5'-gcg ccc agg aag ccg agg aac ata gcg ccg 
atg ccc acg gcg CGC ttc tcg CGC tgc AC-3'). Each minigene were designed in order to 
present single strand overhangs at their 5' and 3'- ends that allow easy ligation and Xhol-Pstl 
15 direct cloning into pBlueScript KS+ vector. 

Snut 1465-Pstl (S 146 5Psti): Two minigenes were constructed following the same methode 
described for snut 2425-E-S. These minigenes are named 1465-1 and 1465-2. The minigene 
1465-1 was obtained after annealing of 1465-1 up (90-mer : 5'-GGC agc acc atg ggc gcc 

20 GCC AGC CTG ACC CTG ACC GTG CAG GCC CGC CAG CTG CTG AGC GGC ATC GTG CAG CAG CAG 

aac aac ctg ctg-3') and 1465-1do (98-mer : 5'-cgc gca gca ggt tgttct gct gct gca cga 

TGC CGC TCA GCA GCT GGC GGG CCT GCA CGG TCA GGG TCA GGC TGG CGG CGC CCA TGG TGC 

tgc ctg ca-3'), whereas minigene 1465-2 results from the annealing of 1465-2up (78-mer ; 
5-CGC GCC ATC GAG GCC CAG CAG CAC CTG CTC CAG CTGA CCG TGT GGG GCA TCA AGC AGC TCC 
25 agg ccc gcg tgc tgg ct-3') and 1465-2do (78-mer ; 5'-CTA GAG CCA GCA CGC GGG CCT GGA 
GCT GCT TGA TGC CCC ACA CGG TCA GCT GGA GCA GGT GCT GCT GGG CCT CGA TGG-3\ Each 

minigene were designed in order to present single strand overhangs at their 5' and 3 - ends 
that allow easy ligation and Pstl-Xbal direct cloning into pBlueScript KS+ vector using 
standard cloning techniques (Maniatis) (see figure 11). 

30 Example 4: assembly of snuts to pieces. 

The snut genes were then assembled into pieces (Table 4) so that unique restriction enzyme 
sites or mutagenesis can be used within each of these. This strategy will require fewer 
assemblings for optimal use of the cassette system. The following piece clones were made 
and kept individually for construction of the synBX08 gp160 gene (Figure 6): 
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Table 4 lists pieces by their name and their snut composition. 



Piece 


snut composition 


vector 


name 






Pi 


So-N-LANG~S235EcoRV~S375Pstl 


pBluescriptSK 


P 2 


SgooXbal'SggoSacrSi 1 lOSpel 


pMOSbiue 


P 3 


Pl-S495 C iarSe50-720EcoRrP2 


pBluescriptSK 


P4gp160 


Si630XbarSi700EagrSi690Hinlll 


pBluescriptSK 


Ps 


S2190Clal~S2330PstrS2425ES 


pBluescriptKS 


P 7 


Pegpieo^SzoeosacirPs 


pBluescriptKS 


PeaDieo 


Sl265XhorS 1 4Q5p st |-P 4aDl60 


pBluescriptKS 



Piece 1: The building strategy is shown in figure 12. 

Preparation of the insert DNA: Five to 15jxg of each plasmid 0-N-LANG-CI7 and 235-EcoRV- 
5 cl5N, respectively, were double-digested by Xbal/EcoRV, and Pstl/EcoRV, according to 
classical RE digestion procedure (Maniatis). The RE digestion products, were agarose gel 
purified according classical method (Maniatis). All RE digests were loaded on a 3% Nusieve 
3:1 (FMC), TBE 0.5X agarose gel and submitted to electrophoresis (7 Volts/mm during 2- 
3hours) until optimal fragment separation. The agarose-band containing the DNA fragments 

1 0 that correspond to the snut's sequence sizes (243-bp for O-N-LANG and 143-bp for 235- 
EcoRV) were excised from the gel. The DNA was extracted from agarose by centrifugation 
20min at 5000g using a spin-X column (Costar cat#8160). Preparation of the vector. The 
snut 375-Pst1 klonl was used as plasmid vector. Five ng were digested with Xbal and Pst1. 
Removal of the polylinker Xbal/Pstl fragment was performed by classical agarose gel 

15 purification, using a 0.9% Seakem-GTG agarose, TBE 0.5X gel. The linearised plasmid DNA 
was extracted from the agarose by filtration through spin-X column. All purified DNA 
fragments were quantified by spectrophotometry. Ligation: All three DNA fragments O-N- 
LANG (Xbal/EcoRV), 235-EcoRV (Pstl/EcoRV) and 375-Pstl(Xbal/Pstl), were ligated 
together by classical ligation procedure, using an equimolar (vector:insert1:insert2) ratio of 

20 1:1:1. Thus for, 200 ng (0. 1 pmole) of Xbal/Pstl-linearised 375-Pstl-cl 1 were mixed with 1 6 
ng of O-N-LANG (Xbal/EcoRV) and 10 ng of 235-EcoRV (Pstl/EcoRV) in a final reaction 
volume of 20*il of 1X ligation buffer containing 10U of T4 DNA ligase (Biolabs, cat#202S). 
The ligation was allowed overnight at 16°C. Transformation: Competent XL1-Blue bacteria 
(Stratagene cat#200130, transformation efficiency > 5»10 6 col^g) were transformed by 

25 classical heat-chock procedure : 1/1 0th of the pre-chilled ligation reaction was mixed with 
50nl of competent bacteria. The mixture was allowed to stand in ice during 30 min. Bactena 
were heat-shocked at 42°C during 45 sec. and then left 2 min. on ice before being 
resuspended in 450 ^l of SOC medium. Transformed bacteria were incubated 1 hour at 37'C 
under shaking (250rpm) and plated on LB-ampicilin agar plates. The recombinant clones 
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were allowed to grow 16 hours at 37°C. Colony screening: 10 to 50 recombinant colonies 
were screened by direct PCR screening according to the protocole described into the 
pMOSBlue blunt-ended cloning kit booklet (RPN 51 10, Amersham). Each colony was picked 
and resuspended in 50pd of water. DNA was freed by a boilling procedure (100°C, 5 
5 min).Ten of bacterial lysate were mixed to 1 ii\ of a 10mM solution of premixed 4 dNTP's , 
1 nl of M1 3reverse primer (5pmoles/jil, 5-CAGGAAACAGCTATGAC-3'), 1 jxl of T7 primer 
(5pmoles/^l, 5'-TAATACGACTCACTATAGGG-3'), 5^1 of 10x Expand HF buffer 2 
(Boehringer Mannheim, cat#1 759078), 0.5^1 of Enzyme mix (Boehringer Mannheim, 5U/^) 
in a final volume of 50^. DNA amplification was performed with a thermo-cycler PE9600 

10 (Perkin-Elmer) using the following cycling parameters: 94°C, 2min, 35 cycles(94°C, 30sec; 
50°C, 15sec; 72°C,30sec); 72°C, 5min; 4°C hold. Five jd of the PCR products were analysed 
after electrophoresis on a 0.9% SeakemGTG , 0.5xTBE agarose gel. Nucleotide sequence 
confirmation: ds-DNA was purified from minicultures of the selected clones with the JETstar 
mini plasmid purification system ( Genomed Inc.). Sequencing was performed using 

15 M13reverse and T7 primers and with the Big DyeTM Terminator Cycle Sequencing Ready 
reaction kit (Perkin-Elmer, Norwalk, Connecticut, P/N4303152) and the ABI-377 automated 
DNA sequenator (Applied Biosystems, Perkin-Elmer, Norwalk, Connecticut). Data were 
processed with the Sequence Navigator and Autoassembler softwares (Applied Biosystems, 
Perkin-Elmer.Norwalk, Connecticut). 

20 

Piece 2: The strategy for building that piece is depicted in figure 13. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The linearised plasmid 1110-Spel-cl24M1 was used as vector after being digested by 
25 Hindlll and Spel, and agarose gel purified. 

- A 166-bp Hindlll/Sacl, obtained from snut 900-Xbal-cl15, as well as a 130-bp Sacl/Spel 
fragment, obtained from snut 990-Sacl-cl14, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
in an one step ligation. 

30 - 1 00>il of competent SCS1 1 0 bacteria (Stratagene cat# 200247) were transformed with 
1/1 0th of the ligation products according to the manufactor instruction. 

- Direct colony PCR screening was performed using T7 primer and pMOS-R (5'- 
GTTGTAAAACGACGGCCAG-3'). 
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Piece 3: The strategy for building that piece is depicted in figure 14. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piece"! , except the following: 

- The plasmid piece1-cl33 was linearised by Clal and Xhol, in order to be used as vector, 
5 and agarose gel purified. 

- A 161-bp Clal/EcoRI fragment, obtained from 495-Clal-cl135M1 as well as a 254-bp 
EcoRI/Xbal fragment, obtained from 650-720-EcoRI-cl39, and a 374-bp Xbal/Xhol fragment, 
obtained from piece2-cl4, were agarose gel purified. 

- Equimolar amount (0.1 pmole) of each of these 4 DNA fragments were mixed and ligated 
10 together. 

- 50p\ of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piece! 

- Direct colony PCR screening was performed using M13Reverse and T7 primers. 

15 Piece 4 gp160: The strategy for building that piece is depicted in figure 15. RE digestion, 
DNA fragments purification, ligation as well as direct PCR screening of recombinant colonies 
were performed according the same procedures described above for piecel , except the 
following: 

- The plasmid 1630-Xbal-cl2 was linearised by Sacll/ Eagl digestion and agarose gel 
20 purified, in order to be used as vector. 

- A 190-bp Eagl/Hindlll fragment, obtained from snut 1700-Eagl-cl4, as well as a 177-bp 
Sacll/Hindlll fragment, obtained from snut 1890-Hindlll-cl8, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
in an one step ligation. 

25 - 50^1 of competent XL1 Blue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel . 

- Direct colony PCR screening was performed using M13Reverse and T7 primers. 

Piece 4-gp150: PCR-based site-directed mutagenesis was performed on double-stranded 
30 plasmid-DNA from piece4-cl4 according an adaptation of the ExSite™ PCR-Based Site- 
Directed Mutagenesis Kit procedure (Stratagene cat#200502)(Weiner, M.P., Costa, G.L., 
Schoettlin, W., Cline, J., Marthur, E., and Bauer, J.C. (1994) Gene 151:119-123). The 
mutations introduced are shown in bold letters in the primer sequences below. PCR 
amplification was performed with the Expand™ High Fidelity PCR System (Boehringer 
35 Mannheim, cat#1759078). Briefly, 1 .5 jig, 0.5 \ig or 0.1 ng of circular dsDNA was mixed with 
1.5 pmoles of P4M2S (5*-TCTGGAAGCTCAGGGGGCTGCATCCCTGGC-3') and 1.5 
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pmoles of P4M2AS (5 , -CCCGCCTGCCCGTGTGACGGATCCAGCTCC-3 , ) in a final volume 
of 50^1 containing 4 dNTPs (250^M each), 1x Expand HF buffer 2 (Boehringer Mannheim, 
cat#1 759078), 0.75fil of Enzyme mix (Boehringer Mannheim, 5U/^I). The PGR was 
performed with a PE9600 thermo-cycler (Perkin-Elmer Corporation) under the following 
5 cycling parameters : 94°C, 2min ; 15 cycles (94 °C, 45 sec ; 68 °C, 4 min); 72°C, 7min and 4 
°C, hold. PCR products were phenohchloroform extracted and precipitated (Maniatis). 
Plasmid template was removed from PCR products by Dpnl treatment (Biolabs)(Nelson, M. ( 
and McClelland, M., 1992) followed by ethanol-precipitation. Amplicons were resuspended in 
50 til steril water, and phosphorylated according the following procedure: 7.5 nJ of amplicons 

10 were mixed with 0.5jal of 100mM DTT, 1 nl of 10x pk buffer and 1 ^l of pk mix enzyme 

(pMOSBlue blunt-ended cloning kit, Amersham , cat#RPN 5110). DNA kinasing was allowed 
5min at 22°C. After heat-inactivation (10min, 75°C) of the pk enzyme, 1|il of ligase (4units, 
Amersham , cat#RPN 5110) was directly added to the pk reaction. The ligation was allowed 
overnight at 22°C. 50jal of competent XUBlue bacteria were transformed with 1/1 0th of the 

15 ligation reaction according to the classical protocol (Maniatis). Insertion of mutations was 
checked by sequencing. 

Piece 4-gp140: PCR-based site-directed mutagenesis was performed on piece 4-cl4, 
according to the procedure described for piece4-gp150 except that the primers P4M1AS (5'- 
20 TGTGTGACTGATTGAGGATCCCCAACTGGC-3') and P4S (5- 
AGCTTGCCCACTTGTCCAGCTGGAGCAGGT-3') were used. 

Snut 1265-Xhol-gp120 : PCR-based site-directed mutagenesis was performed on plasmid 
1265-Xhol-cl2M1 according to the procedure described for piece4-gp150 except that the 
25 primers 1265MAS (5'-CTTCTCGCGCTGCACCACGCGGCGCTTGGC-3') and 1265M2S (5'- 
CGCGCCTAGGGCATCGGCGCTATGTTCCTC-3 , ) were used. 

Snut 1265-Xhol-gp160/uncieaved : PCR-based site-directed mutagenesis was performed 
on plasmid 1265-Xhol-cl2M1 according to the procedure described for piece4-gp150 except 
30 that the primers 1265MAS (5'-CTTCTCGCGCTGCACCACGCGGCGCTTGGC-3') and 
1265M2S (5'-AGCGCCGTGGGCATCGGCGCTATGTTCCTC-3') were used. 

Snut 1465-Pstl-CCG : PCR-based site-directed mutagenesis was performed on plasmid 
1465-Pstl-cl25 according to the procedure described for piece4-gp150 except that the 
35 primers 1465MAS ( S'-CTGCTTG ATGC CCC AC ACG GTC AGCTG-3 ' ) and 1465MS (5'- 
TGCTGCGGCCGCGTGCTGGCTCTAGA-3') were used. 
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Piece 5 : The strategy for building that piece is depicted in figure 16. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 
5 - The plasmid 2425-ES-cl2 was linearised by Clal/EcoRI digestion and agarose gel purified, 
in order to be used as vector. 

-A 129-bp Pstl/Clal fragment, obtained 2190-Clal-cI6M15, as well as a 114-bp Pstl/Ecorl 
fragment, obtained from 2330-Pstl-cl8, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
10 in an one step ligation. 

- 50jj.l of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel 

- Direct colony PCR screening was performed using T3 (5'-ATTAACCCTCACTAAAG-3') and 
T7 primers. 

15 

piece 8 : The strategy for building that piece is depicted in figure 17. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The plasmid piece4-cl4 was linearised by Xbal/Xhol and agarose gel purified, in order to be 
20 used as vector. 

- A 200-bp Xhol/Pstl fragment, obtained from 1265-Xhol-cl2M1 as well as a 178-bp Pstl/Xbal 
fragment, obtained from 1465-Pstl-cl25 were agarose gel purified. 

- Equimolar amount (0.1 pmole) of these 3 DNA fragments were mixed and ligated together. 

- 50^1 of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
25 according to the protocole described for piecel . 

- Direct colony PCR screening was performed using T3 and T7 primers. 

piece 8-gp150 : The strategy for building that piece was identical to that of piece 8, except 
that piece4-cl4M3 was used as vector (figure 18). 

30 

piece8-gp150/ uncleaved : The strategy for building that piece is identical to that of piece 8, 
except that piece4 gp160-cl4M3 is used as vector and a 200-bp Xhol/Pstl fragment, obtained 
from snut 1265-Xhol-gp160/uncleaved as well as a 178-bp Pstl/Xbal fragment, obtained from 
snut 1465-Pstl-CCG are used like inserts. 
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piece 8-gp140 : The strategy for building that piece was identical to that of piece 8, except 
that piece4-cl4M5 was used as vector (figure19). 

piece8-gp140/ uncleaved : The strategy for building that piece is identical to that of piece 8, 
5 except that piece4-cl4M5 is used as vector and a 200-bp Xhol/Pstl fragment, obtained from 
snut 1265-Xhol-gp160/uncleaved as well as a 178-bp Pstl/Xbal fragment, obtained from snut 
1465-Pstl-CCG are used like inserts. 

Piece8-gp41 : The strategy for building that piece is depicted in figure 20. RE digestion, 
10 DNA fragments purification, ligation as well as direct PCR screening of recombinant colonies 
were performed according the same procedures described for synBX08 gp160 gene. A 63 
bp linker is to be made according to the method described for snut 2425-ES, the minigene 
appraoch. Thus for 2 complementary oligonucleotides: 1265-gp41S(5'- 
TCGAGgctagcGCCGTGGGCATCGGCGCTATGTTCCTCGGCTTCCTGGGCGctgca-3') and 
15 1265-gp41AS (5'-gCGCCCAGGAAGCCGAGGAAC- 

ATAGCGCCGATGCCCACGGCgctagcC-3' should be annealed together. This synthetic 
linker will be directly ligated into the Xho\ IPsti sites of piece8-klon13 from which the snut 
1265-Xhol-cl 2M1 would have been removed. 

20 piece7 : The strategy for building that piece is depicted in figure 21. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The plasmid piece5-cl1 was linearised by Clal/Xhol and agarose gel purified, in order to be 
used as vector,. 

25 - A 798-bp Xhol/Sacll fragment, obtained from piece8-cl13 as well as a 140-bp Sacll/Clal 
fragment, obtained from 2060-Sacll-cl21 were agarose gel purified. 

- The ligation of the 3 fragments was performed using a vector.insert ratio of 1:1, 1:2 or 1:5. 

- 50jal of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel . 

30 - Direct colony PCR screening was performed using M13Reverse and T7 primers. 

Example 5: assembly of genes 

synBX08 gp160 gene : The strategy for building that gene is depicted in figure 6. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for piecel 
35 20 jig of the expression plasmid WRG7079 were digested by Nhel/BamHI. Plasmid DNA- 
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ends were dephosphorylated by Calf Intestin Phosphatase treatment (CIP, Biolabs) 
(Maniatis) to avoid automation of any partially digested vector. CIP enzyme was heat- 
inactivated and removed by classical phenol-chloroforme extraction. A 1277-bp Nhel/Xhol 
fragment, obtained from piece3-cl27, as well as a 7794-bp Xhol/BamHI fragment, obtained 
5 from piece7-cl1, were agarose gel purified. The ligation was performed using a vector: insert 
ratio of 1:1 or 1:2. Fifty nl of competent XLIBlue bacteria were transformed with 1/1 0th of the 
ligation product according to the protocole described for piecel . After transformation bacteria 
were plated on LB-kanamycin agar plates. Direct PCR colony screening was performed 
using the primer set WRG-F (5'-AGACATAATAGCTGACAGAC-3') and WRG-R (5'- 
10 GATTGTATTTCTGTCCCTCAC-3'). The nucleotide sequence was determined according the 
methods described above for piecel. 

synBX08 gp150 gene : The strategy for building that gene is depicted in figure 5. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 

1 5 recombinant colonies were performed according the same procedures described for 

synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cl27, as well as 
a 800-bp Xhol/BamHI fragment, obtained from piece 8-gp150-cl26, were agarose gel purified 
and then ligated into the Nhel/BamHI WRG7079 sites. The ligation was performed using a 
vector: insert ratio of 1 : 1 or 1 :2. 

20 For construction of the synthetic BX08 gp1 50, piece4 was mutated to Piece4gp1 50 whereby 
a tyrosine -> cysteine was changed and a stop codon was introduced after the 
transmembrane spanning domaine ( TMD), followed by a BamHI cloning site. A new 
piece8gp150 was constructed composed of snut1265/snut1465/piece4gp150. 

25 synBX08 gp1S0/uncleaved gene : RE digestion, DNA fragments purification, ligation as 
well as direct PCR screening of recombinant colonies are performed according the same 
procedures described for synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained 
from piece3-cl27, as well as a 800-bp Xhol/BamHI fragment, obtained from piece 8- 
gp150/uncleaved, are agarose gel purified and then are ligated into the Nhel/BamHI 

30 WRG7079 sites. The ligation was performed using a vector:insert ratio of 1:1 or 1:2. 

synBX08 gp 140 gene : The strategy for building that gene is depicted in figure 4. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for 
35 synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cl27 ( as well as 
a 647-bp Xhol/BamHI fragment, obtained from piece 8-gp140-cl2, were agarose gel purified 
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and then ligated into the Nhel/BamHI sites of WRG7079. The ligation was performed using a 
vectorinsert ratio of 1:1 or 1:2. For construction of the synthetic BX08 gp140, piece4 was 
mutated to Piece4gp140 whereby a stop codon was introduced just before the TMD followed 
by a BamHI cloning site. A new piece8gp140 was constructed composed of 
5 snutl 265/1 465/piece4gp140. 



synBX08 gp140/uncleaved gene : RE digestion, DNA fragments purification, ligation as 
well as direct PCR screening of recombinant colonies are performed according the same 
procedures described for synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained 
10 from piece3-cl27, as well as a 800-bp Xhol/BamHI fragment, obtained from piece 8- 

gp140/uncleaved, are agarose gel purified and then ligated into the Nhel/BamHI WRG7079 
sites. The ligation was performed using a vectorinsert ratio of 1:1 or 1:2. 



synBX08 gp 120 gene : The strategy for building that piece is depicted in figure 3. RE 
15 digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for 
synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cl27, as well as 
a 206-bp Xhol/BamHI fragment, obtained from 1265-Xhol-gp120-clM5, were agarose gel 
purified and then ligated into the Nhel/BamHI sites of WRG7079. The ligation was performed 
20 using a vector:insert ratio of 1 : 1 or 1 :2. For construction of the synthetic BX08 gp1 20, snut 
1265 was mutated to S 12 65g P i2o to introduce a stop codon at the gp120/gp41 cleavage site 
followed by a BamHI cloning site. 

The gp160, gp150, gp140, and gp120 genes are cloned (Nhei-BamHI) and maintained in an 
25 eucaryotic expression vectors containing a CMV promotor and a tPA leader, but other 

expression vectors may be chosen based on other criteria e.g. antibiotic resistance selection, 
other leader sequences like CD5 etc, presence or not of immune stimulatory sequences etc. 



SynBX08 gp41 gene : The strategy for building that gene is depicted in figure 20. RE 
30 digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for 
synBX08 gp160 gene. Piece 8-gp41 is ligated with snut 2060-Sacll-klon21 and piece 5 as 
already decribed for the construction of piece 7, creating piece 7-gp41 (P 7gP 4i). Subsequently 
the piece 7-gp41 containing the entire gp41 gene will be cloned in WRG7079 using the Nhel 
35 and BamHI sites. 
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Example 6a: High expression by codon optimization 

To analyse the expression of glycoproteins from the wild type and synthetic BX08 envelope 
genes RIPA was performed on transfected mammalian cell lines. Both cell membrane 
associated and secreted HIV-1 glycoproteins from the cell supernatants were assayed. The 
5 envelope plasmids were transfected into the human embryonic kidney cell line 293 (ATCC, 
Rockville, MD) or the mouse P815 (H-2D d ) cell line using calcium phosphate (CellPhect 
Transfection kit, Pharmacia). For radio immune precipitation assay (RIPA), transfected cells 
were incubated overnight, washed twice and incubated for 1 hour with DMEM lacking 
cysteine and methionine (Gibco). Then the medium was replaced with medium containing 50 

10 uCi per ml of [ 35 S] cysteine and 50 uCi/ml of [ 35 S] methionine (Amersham Int., Amersham, 
UK) and incubation continued overnight. Cells were centrifuged, washed twice with HBSS 
and lysed in 1 ml ice-cold RIPA buffer (10 mM Tris, pH 7.4, 150 mM NaCI, 50 mM EDTA, 1% 
Nonidet P-40, 0.5% sodiumdeoxycholate) to detect membrane bound Env glycoproteins. The 
cell lysates were centrifuged for 15 min. at 100,000 x g to remove any undissolved particles ■ 

15 and 100 pi immune precipitated with protein A-sepharose coupled human polyclonal IgG 
anti-HIV antibodies (Nielsen et al., 1987). For analysis of secreted Env glycoproteins 500 ul 
of the 5 ml supernatants from transfected cells were incubated with protein A-sepharose 
coupled human polyclonal IgG anti-HIV antibodies. After washing three times in cold RIPA 
buffer and once in PBS, the immuno precipitates were boiled for 4 min. in 0.05 M Tris-HCI, 

20 pH 6.8, 2% SDS, 10% 2-mercaptoethanol, 10% sucrose, 0.01% bromophenol blue and 
subjected to SDS PAGE (Vinner et al., 1999). Electrophoresis was carried out at 80 mVfor 1 
hour in the stacking gel containing 10% acrylamide, and at 30 mV for 18 hours in the 
separating gradient gel containing 5-1 5% acrylamide. Gels were fixed in 30% ethanol-1 0% 
acetic acid for 1 hour, soaked for 30 min. in En3Hance (Dupont #NEF 981), washed 2 *15 

25 min. in distilled water, dried and autoradiography performed on Kodak XAR-5 film. 
Transfection of human 293 cells with the syn.gp120Bxo 8 and syn.gp140 BX08 genes, 
respectively, resulted in high amounts of only secreted HIV-1 glycoproteins (Fig. 22a, lane 9 
and 8). Thus, the synthetic gene in the absence of rev expresses the HIV-1 surface 
glycoprotein of the expected size which is recognised by human anti-HIV-1 antisera. The 

30 expression of BX08 gp120 was Rev independent and with roughly the same high amount of 
gp120 from the syn.gp120MN gene (Fig. 22a, lane 2). Fig. 22a, lane 6 and lane 7 shows the 
expression of only membrane bound gp160 and gp150 from 293 cells transfected with 
syn.gp160 B xo8 and syn.gp150 BX08 plasmids, respectively. Also transfection with wt.gp160 BX oe 
plasmid resulted in a significant expression of membrane bound gp160 despite the absence 

35 of Rev (Fig. 22a, lane 3). Co-transfection with equimolar amounts of Rev encoding plasmid 
seemed to increase this expression somewhat (Fig. 22a, lane 4). This is seen despite the 
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lower transfection effectivity using two plasmids and the use of only half the amount of 
wt.gp160 B x<>8 DNA when combined with pRev. The amounts of secreted HIV-1 glycoproteins 
from gp120 and gp141 accumulating in the cell supematants seemed higher than the 
amounts of cell associated glycoproteins at the time of harvesting of the cells. Interestingly, 
5 the amounts of gp160 produced from the "humanized" gene were about equal to the 
amounts produced by the wt.gp160 BX 08 + pRev genes, respectively (Fig. 22a, lane 4 and 6). 
The processing of gp160, gp150 and gp140 into gp120 plus a gp41 , or fractions of gp41, 
produced from wild type or synthetic genes in the 293 cell-line did not function well under 
these experimental conditions. Same phenomenon was seen in RIPA from 293-CD4 cells 
10 and HeLa-CD4 cells infected by HIV-1 mm (Vinner et al., 19999). Because of the absence of 
CCR5 these cell-lines could, however, not be infected by HIV-1 strain BX08. 

Example 6b: Radio immuno precipitation assay (RIPA) of synthetic BX08 transfected 
cells showing expression of glycoproteins from synthetic BX08 env plasmids 

15 The synthetic envelope plasmid DNA were transfected into the human embryonic kidney cell 
line 293 (ATCC, Rockville, MD) using calcium phosphate (CellPhect Transfection kit, 
Pharmacia). For immune precipitation analysis, transfected 293 cells were treated and 
analysed according to the method described in example 6a. 
To analyse expression from these genes, an SDS-PAGE of the 35 S-labelled HIV-1 

20 envelopes, immune precipitated from the transfected cells is shown in figure 22b. Both cell- 
membrane associated and secreted HIV-1 envelope glycoproteins in the cell supematants 
were assayed. Transfection of 293 cells with the synthetic BX08 gene encoding gp120 
(syn.gp120BX08) in lane 4, and syn.gp140BX08 (lane 5) that did not contain rev encoding 
regions, resulted in abundant amounts of HIV-1 gp120. Thus, the expressions were Rev 

25 independent and expressed in roughly same high amounts as the syn.gp120MN and 

syn.gp160MN genes (lane 3 and 2, respectively) already showed by our group and others to 
be markedly increased in comparisson with HIV MN wild type genes including rev (Vinner et 
al 1999). 

Transfection with syn.gp150 plasmid (lanes 7 and 8) resulted in significant expression of 
30 membrane associated gp1 20 and low detactable amounts of truncated form of gp41 (cell 
pellet in lane 7) with no detectable HIV-1 glycoprotein in the cell supernatant lane 8. It is 
concluded that the synthetic BX08 genes expres the envelope glycoproteins of expected size 
which are recognised by human anti-HIV-1 antiserum. 
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Exampel 6C FACS 

To quantitate the surface expression of HIV glycoproteins from the wild type and synthetic 
BX08 envelope genes transfection experiments were done and cell surface expression 
5 examined by FACS (flow cytometer). 

10 [ig of the BX08 envelope plasmid (wild type BX08gp160 or synBX08gp160) plus 10 jig of 
an irrelevant carrier plasmid pBluescript were used to transfect a 80-90% confluent layer of 
293 cells in tissue culture wells (25 cm 2 ) using the CellPect kit (Pharmacia). After 48 hours 

10 cells were Versene treated, washed and incubated with a mouse monoclonal IgG antibodies 
to HIV gp120 (NEA-9301 , NEN™, Life-Science Products Inc., Boston) for time 30 min. on wet 
ice followed by washing in PBS, 3% FCS and incubation with Phyto-Erytrin (PE) labelled rat 
anti-mouse lgG1 (Cat #346270, Becton Dickinson) according to the manufacture. After 
washing the cells were fixed in PBS, 1% paraformaldehyd, 3% FCS, and analysed on a 

15 FACS (FACScan, Becton-Dicknsson). Table 5 show in duplicate expression of BX08 gp160 
from 1 1 % of the cells transfected with wild type BX08 (number 1 and 2) compared to the 48 
% of cells expression BX08 glycoprotein when transfected with the synthetic gene (number 3 
and 4). Thus, a several fold higher expression is obtained using the synthetic BX08 gene. 

20 Table 5 FACS analysis of 293 cells transfected with synBX08gp160 (No 1 and 2) and 
wt.gp16+BX08 (No3 and 4) and stained with monoclonal antibodies to surface expressed 
HIV glycoproteins. A higher expression was obtained with the synthetic gene (mean 48%) as 
compared to the wild type gene (mean 11 %). 



50 ul 


45 ul 


A 


B 


C 


C-A 


C-B 


1 syn.gp160BX08 + 


pBluescript SK+ 


2,57 


2,85 


36,91 


34,34 


34,06 


2 syn.gp160BX08 + 


pBluescript SK+ 


2,83 


2,14 


58,42 


55,59 


56,28 


3 wt.gp160BX08 + 


pBluescript SK+ 


1,95 


1,52 


7,51 


5,56 


5,99 


4 wt.gp160BX08 + 


pBluescript SK+ 


2,97 


1,42 


14,41 


11,44 


12,99 



A: No primary antibody added (control for unspecific secondary Ab binding) 
25 B: Neither Primary Ab nor Secondary Ab added (autoflouroscense control) 
C: Primary Ab and secondary Ab added. 



Example 6D Analyses of the surface expression and biological functionality 

To analyse the surface expression and biological functionality from the wild type and 
synthetic BX08 envelope genes transfection experiments were done and cell fusion 
microscopically studied using HIV envelope receptor expressing cells. 
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10 ug of the BX08 envelope plasmid (wt.BX08gp160 or syn.BX08gp160 or empty WRG7079 
vector plasmid) plus 5 ug of a plasmid (pEGFP, Clonetech) expressing green fluorescent 
protein (GFP) were transfected into 2 x 10 s adherent U87.CD4.CCR5 cells (NIH AIDS Res. & 
Reference program, catalog #4035) stabely expressing CD4 and CCR5, using the CellPhect 
5 transfection kit (Pharmacia). After 48 hours the cells were examined by microscopy and 
photographed (Fig 22c). 

Fig 22c panel A show the negative control (empty WRG7079 plus pGFP) giving no syncytia. 
Panel B show cells transfected with the wild type BX08 gp160 plasmid where cell-to-cell 
fusion (syncytia) is seen. Panel C show cells transfected with the same amounts of 
10 synBX08gp160 plasmid and demonstrating a much higher degree of cell-cell fusion. In fact 
most or all of the cells in the culture plate were fused at this time. This experiment show 
surface expression of functional HIV gp160 with tropism to the CCR5 receptor, as well as a 
much higher expression and biological activity from the synthetic BX08 gene as compared to 
the wild type equivalent. 

15 

Example 7: Gene inoculation of mice for immunization 

6-7 weeks old female BALB/c mice were purchased from Bomholdtgaard, Denmak. 
Microbiological status was conventional and the mice were maintained in groups of 4/5 per 
cage with food and water ad libitum and artificially lighted 12 hours per day. Acclimatization 

20 period was 2 days. Mice were anaesthetized with 0.2 ml i.p. of rohypnohstesolid (1 :3, v/v) 
and DNA inoculated by either i.m. injection of 50 nl 2 mg/ml of plasmid DNA in each tibia 
anterior muscle at week 0, 9, and 15 and terminated week 18; or gene gun inoculated on 
shaved abdominal skin using plasmid coated gold particles (0.95 urn particles, 2 ug DNA/mg 
gold, 0.5 mg goid/shot, 50-71% coating efficiency) with the hand held Helios® gene gun 

25 device (BioRad) employing compressed (400 psi) Helium as the particle motive force. Mice 
were gene gun vaccinated at week 0, 3, 6, 9, 15, and terminated week 18. 



Example 8: Serological assays 

Western blotting. The induction of a humoral response to gp120 and gp41 antigens by in 
30 vivo expression of the encoded glycoproteins from the synthetic BX08 genes was examined 
by western immuno blotting (Figure 27). Mouse antisera (1:40) were evaluated in western 
blotting using the commercial HIV BLOT 2.2 strips (Genelabs Diagnostic). The conjugate 
was a 1:200 dilution of the alkaline phosphatase-conjugated rabbit anti-mouse IgG 
(Dakopatts, Glostrup, Denmark). Buffers, incubation condition and colour development were 
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used according to the manufacturer. In these western blotting strips the gp160 band from 
HIV-1 IIIB contain of an oligomeric form of gp41 in a higher concentration than the 
monomeric gp41 band on the strip (Genelabs Diagnostic). HIV-1 ma lysate is used in these 
commercial strips where the gp160 band is composed by addition of tetrameric gp41 .All 
5 preimmune sera tested negative in western blotting. Mice inoculated with syn.gp120 B xo8 
showed antibody response to the heterologous gp120 of HIV-1 IIIB. Inclusion of the 
extracellular part of gp41 in the gene syn.gp14O BX 08 induced antibody reaction to both gp120 
and gp41 in all mice. This confirms the in vivo expression of BX08 gp120 and the 
extracellular part of gp41. DNA vaccination with syn.gp160 B xos encoding the membrane 
10 bound glycoprotein induced antibodies to gp120 and gp41 in 50% and 64% of the mice, 
respectively. DNA vaccination with syn.gplSOsxoe induced detectable antibodies to gp120 
and gp41 in 41% and 53%, respectively. Induction of different levels of antibodies could 
explain the difference in numbers of positive reactive mice sera in this qualitative western 
blotting. 

15 ELISA. Mouse anti HIV-1 gp120 antibodies were measured by indirect ELISA. Briefly, wells 
of polystyrene plates Maxisorb (Nunc) were coated for 2 days at room temperature with HIV- 
1 IIIB recombinant gp120 (Intracel) at 0.2 ug/100 \i\ of carbonate buffer, pH 9.6. Before use 
the plates were blocked 1 hour at room temperature with 150 nl/well of washing buffer (PBS, 
0.5 M NaCI, 1% Triton-X-100) plus 2% BSA and 2% skimmilk powder. After 3 x 1 min. 

20 washings, mouse plasma was added at 100 jil/well diluted in blocking buffer and ELISA 
plates incubated for 90 min. at room temperature using a microtiter plate shaker. As standard 
curve we used a mouse monoclonal antibody to a conserved part of gp120 between V5-C5 
(MRDNWRSELYKY) (#NEA-9301, NEN™ Life Science Products, Inc., Boston, MA). As 
calibration control included on each plate we used a plasma pool from 10 mice vaccinated 

25 with BX08 gp120. Plates were again washed 5 x 1 min. and incubated 1 hour at room 
temperature with 100 ^l/well of HRP-conjugated rabbit anti-mouse IgG (#P260, Dakopatts, 
Glostrup, Denmark) diluted 1:1000 in blocking buffer. Colour was developed with 100 ^l/well 
of peroxidase enzyme substrate consisting of 4 mg of o-phenylenediamine in 1 1 ml water 
plus 4 \x\ hydrogen peroxide (30%, w/w). The enzyme reaction was terminated after 30 min. 

30 by 150 Ml/well of 1 M H 2 S0 4 . The optical density (OD) of wells was measured at 492 nm using 
a microplate photometer (Molecular Devices, Biotech-Line, Denmark). Anti-HIV-gp120 IgG 
titers were expressed as the reciprocal plasma dilution resulting in an OD 492 „ m value of 0.500. 
Mouse anti-HIV-1 BX08 antibodies were also measured by indirect peptide ELISAs as 
described above using a BX08 V3 peptide (SIHIGPGRAFYTTGD) (Schafer, Copenhagen. 

35 Denmark). 
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The IgG antibody response to HIV-1in B rgp120 quantitated by ELISA is seen in Fig. 28 
and Fig. 29. No background activity was observed in preimmune sera or in sera from 4 mice 
immunized with empty WRG7079 vector in parallel with the BX08 genes. All mice inoculated 
with the synthetic BX08 genes either by gene gun or by i.m. injection responded and showed 
5 a persistent and high titered (about 100-10,000) IgG response to rgp120 as exemplified in 
Fig. 4. When comparing the median titers for groups of mice (Fig. 29) a moderate antibody 
response was observed with the wt.gp160 B xo8- Intramuscular and gene gun immunization 
with a mixture of wt.gp160 B xoa plasmid plus Rev encoding plasmid did not increase this 
antibody response. This was found even when both plasmids were coated onto the same 

10 gold particles to ensure co-transfection of single target cells. However, to ensure inoculation 
of equal amounts of total DNA only half of the amount of wt.BX08 plasmid was used when 
mixing with pRev which may have contributed to the lower antibody response when pRev 
was included. A 5-fold improvement of the antibody response was obtained using the 
syn.gp160 B xo8 gene. This antibody response seemed further improved using the 

1 5 syn.gpl 50 BX os gene where the cytoplasmic internalization signals were eliminated but only 
using gene gun inoculation. For both the gene gun inoculation of skin and i.m injection the 
highest antibody titers to rgp120 were induced by genes encoding secreted gp120/gp140 
glycoproteins versus membrane bound gp150/gp160 glycoproteins, respectively. In general, 
equal antibody and ELISA titers to rgp120 were obtained using gene gun and i.m. injection of 

20 the BX08 vaccine genes. 

Example 9: Neutralization assay 

Mouse plasma was diluted in culture medium (RPMI-1640 medium (Gibco) supplemented 
with antibiotics (Gibco), Nystatin (Gibco) and 10% FCS (Bodinco)) and heat inactivated at 
60°C for 30 min. Of the HIV-1 strain BX08 (50 TCID 50 per ml propagated in PBMC) 250 pi 

25 was incubated for 1 hour at room temperature with 250 pi dilution of mouse serum (four five- 
fold dilutions of mouse serum, final dilutions 1:20 to 1:2500). After incubation 1 x 10 6 PBMC 
in 500 pi culture medium was added to the virus-serum mixture and incubated overnight at 
37°C in 5% C0 2 . Subsequently, eight replicates of 10 5 PBMC in 200 pi culture medium were 
cultured in 96-well culture plates (Nunc) at 37°C in 5% C0 2 . After seven days in culture the 

30 concentration of HIV antigen in the culture supernatant was quantitated using HIV antigen 
detection ELISA (Nielsen et al., 1987). 

This ELISA is performed using human IgG, purified from high titered patient sera, both as 
capture antibody and biotin-linked as detecting antibody. In brief, anti-HIV-capture IgG diluted 
1:4000 in PBS, 100 ^l/well, are coated onto Immunoplates (Nunc) overnight at 4°C. After 
35 washing five times in washing buffer 100 pi of supernatants are applied and incubated overnight 
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at 4°C. Plates are washed 5 times before incubation with 100 HIV-IgG conjugated with biotin 
diluted 1:1000 in dilution buffer, plus 10% HIV-1 sero-negative human plasma for 3 hours at 
room temperature. Five times 1 min. washing in washing buffer are followed by 30 min. 
incubation with 100 ^ of 1:1000 avidine-peroxidase (Dako P347 diluted in dilution buffer). Six 
5 times 1 min. washings, 5 in washing buffer and the last one are done in dH 2 0 before colour is 
developed with 100 of peroxidase enzyme substrate consisting of 4 mg of OPD in 1 1 ml 
water plus 4 nl hydrogen peroxide (30 %, w/w). The enzyme reaction is terminated after 30 
minutes by additional 150 \x\ of 1M H 2 S0 4 . 

The HIV antigen concentration in cultures, preincubated with mouse serum, was expressed 
10 relatively to cultures without mouse serum (culture medium), and the percentage inhibitions 
of the different dilutions of mouse serum were calculated. The 50% inhibitory concentration 
(IC 50 ) for each mouse serum was determined by interpolation from the plots of percent 
inhibition versus the dilution of serum, and the neutralizing titer of the serum was expressed 
as the reciprocal value of the IC 5 o- In each set-up a human serum pool known to neutralise * 
1 5 other HIV-1 strains was included in the same dilutions as the mouse serum as a calibratin 
control. For assay of neutralization of the heterologous SHIV89.6P the MT-2-cell-killing 
format was used (Crawford et al., 1999). The assay stock of SHIV89.6P was grown in human 
PBMC. 

The neutralizing IC 50 antibody titers of plasma pools from 10 mice from each group 
20 were measured at different time points (week 0 t 9, and 18). A positive background in some 
preimmune sera and thus in all week 0 serum pools was noted even after dilution and heat 
inactivation that was found earlier to lower this background. In general the neutralizing titers 
to BX08 virus of such serum pools were transient and low ranging from 1:6-1:150 above 
background (data not shown). A possible cross-neutralization reaction to a heterologous, 
25 primary HIV-1 envelope was tested using the SHIV89.6P which is relevant in macaque 
models of AIDS and serum pools from mice DNA immunized i.m. with syn.gp140 B xos. 
Preimmune serum had a titer of 1:37, which is indicative of a slightly positive background, 
whereas the 18 week p.i. serum had a positive neutralizing titer of 1:254 above background. 

30 Example 10: CTL assay 

The cellular immune response in mice following gene gun or i.m. genetic immunization with 
the different vaccine plasmids were examined (Fig. 26). Spleen was removed aseptically and 
gently homogenised to single cell suspension, washed 3 times in RPMI-1640 supplemented 
with 10% FCS and resuspended to a final concentration of 5 x 10 7 cell/ml. The cells were 
35 then incubated 5 days with mitomycin-C treated (50 ng/ml for 1 hour) mouse P815 (H-2D d ) 
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stimulator cells at a ratio of 10:1 in medium supplemented with 5 x 10* 5 M (3- 
mercaptoethanol. For assay of CTL response to HIV-1 BX08, P815 stimulator cells and 
target cells were pulsed with 20 ng/ml of the HIV-1 BX08 V3 peptide containing a conserved 
murine H-2D d restricted CTL epitope (IGPGRAFYTT) (Lapham et al., 1996). After 
5 stimulation, splenocytes were washed three times with RPMI-1640 supplemented with 10% 
FCS and resuspended to a final concentration of 5 x 10 6 cells/ml. 100 pi of cell suspension 
was added in triplicate to U-bottom 96-well microtiter plates and a standard 4 hour 51 Cr- 
release assay performed (Marker et al., 1973). 

All synthetic BX08 plasmids induced a high specific CTL response thus confirming the in vivo 
10 expression and in vivo immunogenicity. The highest CTL response was obtained with 
syn.gp150 BX o8 followed by syn.gp120 B xo8-syn.gp140 BX o8, and syn.gp160 BX 08, respectively. 
Thus, the CTL response induced did not correlate with the antigen being secreted or not. 
However, i.m. DNA immunization with syn.gpl 50^08 containing six putative CpG motifs 
induced a higher CTL response than gene gun immunization (Fig. 26). This difference could- 
15 be explained by the high amount of DNA used in the i.m. injections. 

The T-lymphocyte cytokine profile of spleen cells after ConA stimulation as well as serum 
antibody IgGza/IgG, at week 18 were investigated. Neither the IFNy/IL-4 nor the IgG^lgd 
ratios, which both reflects a Th1-type of immune response, were significantly higher for the 
i.m. immunized mice when compared with gene gun immunized mice (student t-test and 
20 Mann-Withney U-test). Thus, the CTL response did not correlate with a certain Th-type of 
response and the DNA immunization technique did not bias the immune response using 
synthetic BX08 genes. 

Example 11: Antibody responses to DNA vaccination with synBX08 env plasmid 

25 A relatively low and variable antibody response (1 of 10 mice) was obtained with gene gun 
inoculation of the syn.gpl 40BX08 plasmid vaccine starting at week 9, figure 23, right panel. 
A higher numbers of responders 3/10 with high lgG1 antibody responses at an earlier onset 
(week 3-9) was obtained with the syn.gp140BX08 plasmid using i.m. injection, left panel. 
Sera from later time points may show more responders and/or higher titers but are not 

30 assayed. However, these results show the induction of an antibody response to the BX08 V3 
peptide by DNA vaccination using one of the described synthetic BX08 constructs. 
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Claims 

1 . A method for producing a nucleotide sequence construct comprising the following steps: 

a) obtaining a first nucleotide sequence of an HIV gene from a patient within the first 12 
5 months of infection; 

b) designing a second nucleotide sequence utilising the most frequent codons from 
mammalian highly expressed proteins to encode the same amino acid sequence as the 
first nucleotide sequence of a) encodes 

c) redesigning the second nucleotide sequence of b) so that restriction enzyme sites 

1 0 surrounds the regions of the nucleotide sequence which encode functional regions of the 
amino acid sequence and so that selected restriction enzyme sites are removed thereby 
obtaining a third nucleotide sequence encoding the same amino acid sequence as the 
first and the second nucleotide sequence of a) and b) encode; 

d) redesigning the third nucleotide sequence of c) so that the terminal snuts contain 
15 convenient restriction enzyme sites for cloning into an expression vehicle; 

e) producing the snuts between restriction enzyme sites of c) and terminal snuts of d); 

f) assembling the snut of step e) to form a nucleotide sequence construct. 

2. A method according to claim 1 , wherein the HIV gene is the gene encoding the envelope. 

20 

2. A method according to claim 1 or 2, wherein the HIV gene encodes one or more Gag 
proteins. 

4. A method according to any of the preceding claims, wherein the HIV in step a) is in group 
25 M,OorN 

5. A method according to claim 4, wherein the HIV is a group M virus. 

6. A method according to any of the preceding claims, wherein the HIV is subtype A, B, C, D, 
30 E, F, G, H, I, or J. 

7. A method according to claim 6, wherein the HIV is subtype B. 

8. A method according to any of the preceding claims wherein the first nucleotide sequence 
35 is obtained by direct cloning. 
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9. A method according to any of the preceding claims, wherein the HIV in step a) is isolated 
with the first 1 1 months of infection, such a 10, 9, 8 ? 7, 6, 5, 4, 3, 2, 1 , or 0.5 month after 
infection. 

10. A method according to any of the preceding claims, wherein the redesigning in step c) is 
carried out after the second nucleotide sequence of step b) has been divided into pieces, 
so that each piece comprises only different restriction enzyme sites. 

1 1 . A method according to claim 10, wherein the second nucleotide sequence of step b) is 
divided into 9 pieces, or 8, or 7, or 6, or 5, or 4, or 3, or 2 pieces. 

12. A method according to claim 1 1 , wherein the second nucleotide sequence of step b) is 
divided into 3 pieces. 

13. A method according to any of the preceding claims, wherein the second nucleotide 
sequence of step b) is designed utilising the most frequent codons from human highly 
expressed proteins to encode the same amino acid sequence as the first nucleotide 
sequence of step a) encodes. 

14. A nucleotide sequence construct obtainable by the method of any of claims 1-13. 

15. A nucleotide sequence construct according to claim 14, wherein the nucleotide sequence 
encoding the amino acid sequence in the first variable region is surrounded by EcoRV 
and Pst\ restriction enzyme sites. 

16. A nucleotide sequence construct according to claims 14 or 15, wherein the nucleotide 
sequence encoding the amino acid sequence in the second variable region is surrounded 
by Pst\ and C/al restriction enzyme sites. 

17. A nucleotide sequence construct according to any of claims 14-16, wherein the 
nucleotide sequence encoding the amino acid sequence in the third variable region is 
surrounded by C/al and EcoRI restriction enzyme sites. 
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18. A nucleotide sequence construct according to any of claims 14-17, wherein the 

nucleotide sequence encoding the amino acid sequence in the transmembrane spanning 
region is surrounded by HindUl and Sacll restriction enzyme sites. 

5 19. A nucleotide sequence construct according to any of claims 14-18, wherein the 

nucleotide sequence encoding the amino acid sequence on both sites of the cleavage site 
is surrounded by Pst\ and Xba\ restriction enzyme sites. 

20. A nucleotide sequence construct in isolated form which has a nucleotide sequence with 
10 the general formula (I), (II), (III), or (IV) or subsequences thereof 

(I) Pl"S495Cla|-S650-720EcoR|-P2-Si265gp120 

(II) P 1-S495ClarS 6 50-720EcoRrP2-S 12 65XhO|- Si465Pstr P4gp140 

(III) Pl-S495ClarS 6 50-720EcoRrP2-S 12 65Xhor Si465Pstr P4gp150 

(IV) Pl-S495ClarS 6 5o.720EeoR|-P2-S 12 65Xhor S 146 5Pstr P4gp160- SajeoSacir Ps 

15 wherein P, designates the nucleotide sequence SEQ ID NO:41, a nucleotide sequence 

complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 

wherein S 495C iai designates the nucleotide sequence SEQ ID NO: 7, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 
20 95%thereto; 

wherein S 65 o-72oecori designates the nucleotide sequence SEQ ID NO: 9, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 95% thereto; 

wherein P 2 designates the nucleotide sequence SEQ ID NO: 43, a nucleotide sequence 
25 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein S 1265 gp 120 designates the nucleotide sequence SEQ ID NO: 19, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 70% thereto; 

30 wherein S 1265 xhoi designates the nucleotide sequence SEQ ID NO: 17, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 80% 
thereto; 

wherein S 146 5Psti designates the nucleotide sequence SEQ ID NO: 23, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
35 thereto; 
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wherein P 4gp i4o designates the nucleotide sequence SEQ ID NO: 57, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4gp i5o designates the nucleotide sequence SEQ ID NO: 55, a nucleotide sequence 
5 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4gp i6o designates the nucleotide sequence SEQ ID NO: 53, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

10 wherein S 20 6osacii designates the nucleotide sequence SEQ ID NO: 33, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 98% thereto; and 
wherein P 5 designates the nucleotide sequence SEQ ID NO: 59, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 

15 thereto. 

21. A nucleotide sequence construct according to claim 20, with the formula (I) 

( I ) Pi -S495ci a r S650-720ECORI- P2-S 1 285gp1 20 

20 22. A nucleotide sequence construct according to claim 20, with the formula (II) 

(II) PrS495ClarS 6 50-720EcoRrP2-Si265Xhor Si485Pstr P4gp140 

23. A nucleotide sequence construct according to claim 20, with the formula (III) 

(III) P 1-S495ci a rS 6 50-720EcoRrP2-Si265Xhor Si 46 5Pstr P4gp150 

25 

24. A nucleotide sequence construct according to claim 20, with the formula (IV) 

(IV) Pl-S 4 95ci a rS650-720EcoRrP2-Si265Xhor Si 46 5Pstr P4gp160" S 2 060Sacir Ps 

25. A nucleotide sequence construct according to claim 20 consisting essentially of the 
30 subsequence P^ 

26. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 4 «5Chi. 

35 27. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 6 5o-72oecori. 
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28. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 2 . 

5 29. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 12 65g P i2o. 

30. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 126 5Xhoi- 

10 

31 . A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 1485 p st ,. 

32. A nucleotide sequence construct according to claim 20 consisting essentially of the 
15 subsequence P 4 g P i4o. 

33. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 4g piso- 

20 34. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 4g pieo. 

35. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 20 6osacii 

25 

36. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 5 . 

37. A nucleotide sequence construct with a sequence identity of more than 85% to the 
30 nucleotide sequence construct in any of claims 20-35. 

38. A nucleotide sequence construct according to claim 37, wherein the sequence identity is 
more than 90% such as more than 95%, 98%, or 99%. 

35 39. A nucleotide sequence construct according to claim 37, wherein the sequence identity is 
100%. 
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40. A nucleotide sequence construct according to any of claims 14-39, coding for an HIV 
envelope or parts thereof with an improved immunogenicity obtained by mutating the 
nucleotide sequence construct of any of claims 14-39 such that one or more glycosylation 

5 sites in the amino acid sequence have been removed. 

41 . A nucleotide sequence construct according to claim 40 with a mutation at positions 
A307C + C309A and/or A325C + C327G and/or A340C + C342A and/or A385C + C387A 
and/or A469C + C471A or any combination of those. 

10 

42. A nucleotide sequence construct according to any of claims 14-41 , coding for an HIV 
envelope or parts thereof with a binding site for the CXCR4 co-receptor in the third 
variable region. 

15 43. A nucleotide sequence construct according to claim 42 with a mutation at positions 
G865C + A866G. 

44. A nucleotide sequence construct according to any of claims 14-43, coding for an HIV 
envelope or parts thereof, wherein an immunodominant epitope has been modified. 

20 

45. A nucleotide sequence construct according to claim 44, wherein an immunodominant 
epitope in the third variable region has been modified. 

46. A nucleotide sequence construct according to claim 45 with a deletion of nucleotides 
25 793-897. 

47. A nucleotide sequence construct according to claim 44, wherein an immunodominant 
epitope has been removed from gp41. 

30 48. A nucleotide sequence construct according to any of claims 14-47, coding for an HIV 
envelope or parts thereof, wherein the cleavage site between gp41 and gp120 is 
removed. 

49. A nucleotide sequence construct according to claim 48 with a mutation at position 
35 C1423A. 
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50. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 4 95ciai. S 65 o-72oecori. and P 2 . 

51. A nucleotide sequence construct according to claim 20 consisting essentially of the 
5 subsequence S 126 5Xhoi, S 14 65Pstn and P 4gp i4o. 

52. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 12 65Xhoi» Si465Psti» P4gpieo. S 2 o6osacih anc * 

10 53. A nucleotide sequence construct according to any of claims 14-52, further comprising a 
nucleotide sequence repeat coding for a functional region of the amino acid sequence. 

54. A nucleotide sequence construct according to claim 53, wherein t! ie nucleotide sequence 
repeat codes for amino acids in the third variable region. 

15 

55. A nucleotide sequence construct according to any of claim 14-54, further comprising a 
nucleotide sequence coding for a T-helper cell epitope containing sequence. 

56. An expression vehicle selected from a group of viral vectors consisting of simliki forest 
20 virus (sfv), adenovirus and Modified Vaccinia Virus Ankara (MVA), further comprising a 

nucleotide sequence construct according to any of claim 14-55. 

57. A method of individualised immunotherapy wherein the virus from a newly diagnosed 
patient is directly cloned, the envelope is produced with highly expressed codons, inserted 

25 into any of the nucleotide sequence constructs of claims 14-55, and administered to the 
patient. 

58. Use of a nucleotide sequence construct according to any of claims 14-55 in medicine. 

30 59. Use of a nucleotide sequence according to any of claims 14-55 for the manufacture of a 
vaccine for the prophylactics of infection with HIV in humans. 



35 



60. Use of a nucleotide sequence according to any of claims 14-55 for the manufacture of a 
composition for the treatment of an HIV infection in a human within 24 weeks of primary 
infection. 



WO 00/29561 PCT/DK00/00144 

58 

61 . Use of the nucleotide sequence according to any of claims 14-55 for the production of i 
recombinant protein. 
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Fig. 4 
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Fig. 6 
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Fig. 22 A 
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Fig. 22 C 
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Fig. 22 C 
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Fig. 22 C 
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Fig. 24 
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CTL activity induced by 
different synthetic DNA vaccines 
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Fig. 28 A 



100000 t 
10000 



£ 1000 - 

o 

100 -• 



10 -- 



Syn.gpl50BX08 i.m. 



12 



15 




"O-U 
-D-1,2 
-0-1,4 
"♦-1,5 
♦-2,1 
-A— 2,2 



-*-2,4 
"4—2,5 



18 Weeks 



Fig. 28 B 



100000 t 



10000 -- 



u 

•2 iooo + 
o 

~ 100 + 



10 -- 



Syn.gpl50BX08 g.g. 



12 



15 




18 



-0-3,1 
*-3,2 
-±-3,3 



-*-3,4 
-*-3,5 
0-4.1 
-H-4,2 

4,3 

4,4 



Weeks 



Fig. 28 



WO 00/29561 



PCT/DK00/00144. 



33/33 
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SEQUENCE LISTING 
<110> Statens Serum Institut 

<120> Method for producing an nucleotide 

sequence construct with optimised codons for an HIV envelope 
based on a primary, clinical HIV isolate and the BX08 
construct . 

<130> 21924DK1 

<160> 76 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 243 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (243) 

<400> 1 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 
Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
1 5 io 



15 



gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gec tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 



144 



192 



60 



aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 7 0 75 80 



ate 
He 



240 



243 



<210> 2 
<211> 81 
<212> PRT 

<213> Artificial Sequence 
<400> 2 

r Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
5 10 15 
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Val 
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Asp Ala 


Thr Thr Thr 
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Phe 


Cys 


Ala 


Ser 
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20 
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Val 
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Pro Gin Glu 


Val 


Val 
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Gly Asn 


Val 


Thr 


Glu 




50 






55 








60 
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Phe Asn 


Met 


Gly 


Lys Asn Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 
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80 


He 
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1 5 10 15 

gta cgc cct gtt eta cag cct gga cat cgt gec cat cga caa cga caa 96 

Val Arg Pro Val Leu Gin Pro Gly His Arg Ala His Arg Gin Arg Gin 

20 25 30 

cac cag eta ccg cct gcg cag ctg caa cac ate gat 132 

His Gin Leu Pro Pro Ala Gin Leu Gin His He Asp 

35 40 



<210> 6 
<211> 44 
<212> PRT 

<213> Artificial Sequence 
<400> 6 

Leu Gin Leu Gin His Gin His Gin Arg Ala Gin Gin Asp Glu Ala Arg 

1 5 10 15 

Val Arg Pro Val Leu Gin Pro Gly His Arg Ala His Arg Gin Arg Gin 

20 25 30 

His Gin Leu Pro Pro Ala Gin Leu Gin His He Asp 
35 40 

<210> 7 
<211> 161 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (161) 

<400> 7 

ate gat cat cac cca ggc ctg ccc caa ggt gag ctt cga gec cat ccc 4 8 

He Asp His His Pro Gly Leu Pro Gin Gly Glu Leu Arg Ala His Pro 

1 5 10 15 

cat cca ctt ctg cgc ccc cgc egg ctt cgc cat cct gaa gtg caa caa 96 
His Pro Leu Leu Arg Pro Arg Arg Leu Arg His Pro Glu Val Gin Gin 
20 25 30 

caa gac ctt caa egg cac egg ccc ctg cac caa cgt gag cac cgt gca 144 
Gin Asp Leu Gin Arg His Arg Pro Leu His Gin Arg Glu His Arg Ala 
35 40 45 

gtg cac cca egg aat tc 2.61 
Val His Pro Arg Asn 
50 



<210> 8 
<211> 53 
<212> PRT 

<213> Artificial Sequence 
<400> 8 

He Asp His His Pro Gly Leu Pro Gin Gly Glu Leu Arg Ala His Pro 
1 5 10 15 
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His Pro Leu Leu Arg Pro Arg Arg Leu Arg His Pro Glu Val Gin Gin 

20 25 30 

Gin Asp Leu Gin Arg His Arg Pro Leu His Gin Arg Glu His Arg Ala 

35 40 45 

Val His Pro Arg Asn 
50 



<210> 9 
<211> 254 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (254) 

<400> 9 

gaa ttc gcc ccg tgg tga gca ccc age tgc tgc tga acg gca gec tgg 48 
Glu Phe Ala Pro Trp * Ala Pro Ser Cys Cys * Thr Ala Ala Trp 
1 5 10 



ccg agg agg agg tgg tga tea gat ctg aga act tea cca aca acg cca 96 
Pro Arg Arg Arg Trp * Ser Asp Leu Arg Thr Ser Pro Thr Thr Pro 
15 20 25 



aga cca tea teg tgc age tga acg aga gcg tgg aga tea act gca ccc 14 4 

Arg Pro Ser Ser Cys Ser * Thr Arg Ala Trp Arg Ser Thr Ala Pro 
30 35 40 



gcc cca aca aca aca ccc gca aga gca tec aca teg gcc ctg gcc gcg 192 
Ala Pro Thr Thr Thr Pro Ala Arg Ala Ser Thr Ser Ala Leu Ala Ala 
45 50 55 60 

cct tct aca cca ccg gcg aca tea teg gcg aca tec gcc agg ccc act 240 
Pro Ser Thr Pro Pro Ala Thr Ser Ser Ala Thr Ser Ala Arg Pro Thr 
65 70 75 



gca aca tct eta ga 
Ala Thr Ser Leu 
80 
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10 






<211> 


80 






<212> 


PRT 






<213> 


Artificial ; 


Sequence 




<400> 


10 




Glu 


Phe Ala 


Pro Trp Ala 


Pro Ser 


1 




5 




Arg Arg Trp 


Ser Asp Leu 


Arg Thr 






20 




Ser 


Cys Ser 


Thr Arg Ala 


Trp Arg 




35 




40 


Thr 


Pro Ala 


Arg Ala Ser 


Thr Ser 




50 




55 


Pro 


Ala Thr 


Ser Ser Ala 


Thr Ser 


65 




70 





Cys Cys Thr Ala Ala Trp Pro Arg 

10 15 
Ser Pro Thr Thr Pro Arg Pro Ser 
25 30 
Ser Thr Ala Pro Ala Pro Thr Thr 
45 

Ala Leu Ala Ala Pro Ser Thr Pro 
60 

Ala Arg Pro Thr Ala Thr Ser Leu 
75 80 
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<210> 11 
<211> 92 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (92) 



<400> 11 

tct aga acc aac tgg acc aac acc ctg aag cgc gtg gcc gag aag ctg 48 

Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 
5 10 .15 

cgc gag aag ttc aac aac acc acc ate gtg ttc aac cag age tc 92 

Arg Glu Lys Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser 
20 25 30 



<210> 12 
<211> 30 
<212> PRT 

<213> Artificial Sequence 



<400> 12 

Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 

1 5 io 15 

Arg Glu Lys Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser 
20 25 30 

<210> 13 
<211> 130 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (130) 

<400> 13 

gag etc egg egg cga ccc cga gat cgt gat gca cag ctt caa ctg egg 4 8 

Glu Leu Arg Arg Arg Pro Arg Asp Arg Asp Ala Gin Leu Gin Leu Arg 

1 5 10 15 

egg cga gtt ctt eta ctg caa cac cac cca get gtt caa cag cac ctg 96 
Arg Arg Val Leu Leu Leu Gin His His Pro Ala Val Gin Gin His Leu 
20 25 30 

gaa cga gac caa cag cga ggg caa cat cac tag t 130 
Glu Arg Asp Gin Gin Arg Gly Gin His His * 
35 40 



<210> 14 
<211> 42 
<212> PRT 

<213> Artificial Sequence 



<400> 14 
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Glu Leu Arg Arg Arg Pro Arg Asp Arg Asp Ala Gin Leu Gin Leu Arg 

1 5 10 15 

Arg Arg Val Leu Leu Leu Gin His His Pro Ala Val Gin Gin His Leu 

20 25 30 

Glu Arg Asp Gin Gin Arg Gly Gin His His 
35 4 0 





<210> 


15 






















<211> 


164 






















<212> 


DNA 






















<213> 


Artificial 


Sequence 
















<220> 
























<221> 


CDS 






















<222> 


(1) ■ 


. . . (164) 




















<400> 


15 




















act 


agt ggc 


ace 


ate ace 


ctg 


ccc 


tgc 


cgc 


ate 


aag 


cag 


ate 


Thr 


Ser Gly 


Thr 


He Thr 


Leu 


Pro Cys 


Arg 


He 


Lys 


Gin 


He 


1 






5 








10 










atg 


tgg cag 


gag 


gtg ggc 


aag 


gee 


atg 


tac 


gee 


ccc 


ccc 


ate 


Met 


Trp Gin 


Glu 


Val Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


He 






20 








25 










30 


cag 


ate aag 


tgc 


ctg age 


aac 


ate 


acc 


ggc 


ctg 


ctg 


ctg 


acc 


Gin 


lie Lys 


Cys 


Leu Ser 


Asn 


He 


Thr 


Gly 


Leu 


Leu 


Leu 


Thr 




35 








40 










45 





15 



48 



96 



144 



ggc ggc age gac aac teg ag !64 
Gly Gly Ser Asp Asn Ser 
50 



<210> 16 
<211> 54 
<212> PRT 

<213> Artificial Sequence 
<400> 16 

Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin He He Asn 

1 5 10 15 

Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro He Gly Gly 

20 25 30 

Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp 

35 40 45 

Gly Gly Ser Asp Asn Ser 
50 

<210> 17 
<211> 200 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (200) 



<400> 17 
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etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg " 

20 25 30 

gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 144 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 

35 40 45 



c^a gaa gcg cgc cgt ggg cat egg cgcrtat gtt cct egg ctt cct ggg 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 



cgc tgc ag 
Arg Cys 

. 65 



192 



200 



<210> 18 
<211> 66 
<212> PRT 

<213> Artificial Sequence 
<400> 18 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

1 5 10 15 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 

20 25 30 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 

35 40 45 

Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Glv 

50 55 60 

Arg Cys 
65 

<210> 19 
<211> 212 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (212) 

<400> 19 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 144 
Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 
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cga gaa gcg cgc eta ggg cat egg cgc tat gtt cct egg ctt cct ggg 
Arg Glu Ala Arg Leu Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 



192 



cgc tgc age ccg ggg gat cc 
Arg Cys Ser Pro Gly Asp 
65 70 



212 



<210> 20 
<211> 70 
<212> PRT 

<213> Artificial Sequence 





<400> 


20 


















Leu 


Glu Gin 


Arg 


Gin 


Gly 


Asp 


Phe 


Pro Pro 


Arg 


Arg Arg 


Arg His Ala 


1 






5 








10 






15 


Arg 


Gin Leu 


Ala 


Gin 


Arg 


Ala 


Val 


Gin Val 


Gin 


Gly Gly 


Glu Asp Arg 






20 










25 






30 


Ala 


Pro Gly 


His 


Arg 


Pro 


His 


Gin 


Gly Gin 


Ala 


Pro Arg 


Gly Ala Ala 




35 










40 






45 




Arg 


Glu Ala 


Arg 


Leu 


Gly 


His 


Arg 


Arg Tyr 


Val 


Pro Arg 


Leu Pro Gly 




50 








55 








60 


Arg 


Cys Ser 


Pro 


Gly 


Asp 














65 








70 
















<210> 


21 




















<211> 


200 




















<212> 


DNA 



















<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (200) 

<400> 21 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 
1 5 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

cga gaa gag cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 
Arg Glu Glu Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 



cgc tgc ag 200 
Arg Cys 
65 



<210> 22 
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<211> 66 
<212> PRT 

<213> Artificial Sequence 



<400> 


22 














Leu Glu 


Gin 


Arg 


Gin 


Gly Asp 


Phe Pro 


Pro 


Arg Arg Arg Arg His Ala 


1 






5 






10 




15 


Arg Gin 


Leu 


Ala 


Gin 


Arg Ala 


Val Gin 


Val 


Gin 


Gly Gly Glu Asp Arg 






20 






25 






30 


Ala Pro 


Gly 


His 


Arg 


Pro His 


Gin Gly 


Gin 


Ala 


Pro Arg Gly Ala Ala 




35 








40 






45 


Ara Glu 


Glu 


Arg 


Arg 


Gly His 


Arg Arg 


Tyr 


Val 


Pro Arg Leu Pro Gly 


50 








55 








60 



Arg Cys 
65 

<210> 23 
<211> 178 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) ... (178) 

<400> 23 

ctg cag gca gca cca tgg gcg ccg cca gcc tga ccc tga ccg tgc agg 48 
Leu Gin Ala Ala Pro Trp Ala Pro Pro Ala * Pro * Pro Cys Arg 
15 10 



ccc gcc age tgc tga gcg gca teg tgc age age aga aca acc tgc tgc 96 
Pro Ala Ser Cys * Ala Ala Ser Cys Ser Ser Arg Thr Thr Cys Cys 
15 20 25 



gcg cca teg agg ccc age age acc tgc tec age tga ccg tgt ggg gca 144 
Ala Pro Ser Arg Pro Ser Ser Thr Cys Ser Ser * Pro Cys Gly Ala 
30 35 40 



tea age age tec agg ccc gcg tgc tgg etc tag a 178 
Ser Ser Ser Ser Arg Pro Ala Cys Trp Leu * 
45 50 



<210> 24 

<211> 54 

<212> PRT 

<213> Artificial Sequence 



<400> 24 

Leu Gin Ala Ala Pro Trp Ala Pro 

1 5 
Ser Cys Ala Ala Ser Cys Ser Ser 
20 

Arg Pro Ser Ser Thr Cys Ser Ser 

35 40 
Arg Pro Ala Cys Trp Leu 
50 



Pro Ala Pro Pro Cys Arg Pro Ala 

10 15 

Arg Thr Thr Cys Cys Ala Pro Ser 

25 30 

Pro Cys Gly Ala Ser Ser Ser Ser 
45 



<210> 25 
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<211> 178 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (178) 

<400> 25 

ctg cag gca gca cca tgg gcg ccg cca gcc tga ccc tga ccg tgc aqq 

Leu Gin Ala Ala Pro Trp Ala Pro Pro Ala * Pro * Pro Cys Arg 
1 5 io 

ccc gcc age tgc tga gcg gca teg tgc age age aga aca acc tgc tgc 
Pro Ala Ser Cys * Ala Ala Ser Cys Ser Ser Arg Thr Thr Cys Cys 
15 20 25 

gcg cca teg agg ccc age age acc tgc tec age tga ccg tgt ggg gca 
Ala Pro Ser Arg Pro Ser Ser Thr Cys Ser Ser * Pro Cys Gly Ala 
30 35 40 

tea age agt get gcg gcc gcg tgc tgg etc tag a 
Ser Ser Ser Ala Ala Ala Ala Cys Trp Leu * 
45 50 



48 



96 



144 



178 



<210> 26 
<211> 54 
<212> PRT 

<213> Artificial Sequence 
<400> 26 

Leu Gin Ala Ala Pro Trp Ala Pro Pro Ala Pro Pro Cys Arg Pro Ala 

15 10 15 

Ser Cys Ala Ala Ser Cys Ser Ser Arg Thr Thr Cys Cys Ala Pro Ser 

20 25 30 

Arg Pro Ser Ser Thr Cys Ser Ser Pro Cys Gly Ala Ser Ser Ser Ala 

35 40 45 

Ala Ala Ala Cys Trp Leu 
50 

<210> 27 
<211> 77 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) ... . (77) 

<400> 27 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 
Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 
1 5 10 15 

etc egg caa get gat ctg cac cac ggc eg 
Leu Arg Gin Ala Asp Leu His His Gly 
20 25 



48 
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<210> 28 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<400> 28 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

15 10 15 

Leu Arg Gin Ala Asp Leu His His Gly 
20 25 



<210> 29 
<211> 190 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (190) 



<400> 29 

egg ccg tgc cct gga acg cca get gga gca aca aga acc tga gec aga 48 
Arg Pro Cys Pro Gly Thr Pro Ala Gly Ala Thr Arg Thr * Ala Arg 
1 5 10 15 



ttt ggg aca aca tga cct gga tgg agt ggg age gcg aga tea gca act 96 

Phe Gly Thr Thr * Pro Gly Trp Ser Gly Ser Ala Arg Ser Ala Thr 

20 25 30 

aca ccg aga tea tct aca gec tga teg agg aga gec aga acc age agg 144 

Thr Pro Arg Ser Ser Thr Ala * Ser Arg Arg Ala Arg Thr Ser Arg 

35 40 45 



aga aga acg age tgg acc tgc tec age tgg aca agt ggg caa get t 190 
Arg Arg Thr Ser Trp Thr Cys Ser Ser Trp Thr Ser Gly Gin Ala 
50 55 60 



<210> 30 
<211> 60 
<212> PRT 

<213> Artificial Sequence 



<400> 30 

Arg Pro Cys Pro Gly Thr Pro Ala Gly Ala Thr Arg Thr Ala Arg Phe 

15 10 15 

Gly Thr Thr Pro Gly Trp Ser Gly Ser Ala Arg Ser Ala Thr Thr Pro 

20 25 30 

Arg Ser Ser Thr Ala Ser Arg Arg Ala Arg Thr Ser Arg Arg Arg Thr 

35 40 45 

Ser Trp Thr Cys Ser Ser Trp Thr Ser Gly Gin Ala 
50 55 60 



<210> 31 
<211> 177 
<212> DNA 
<213> Artificial 



Sequence 
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<220> 

<221> CDS 

<222> (1) . . . (177) 

<400> 31 

aag ctt gtg gaa ctg gtt caa cat cac caa ctg get gtg gta cat caa 48 

Lys Leu Val Glu Leu Val Gin His His Gin Leu Ala Val Val His Gin 

1 5 10 15 

gat ttt cat cat gat cgt ggg egg cct gat egg cct gcg cat cgt gtt 96 
Asp Phe His His Asp Arg Gly Arg Pro Asp Arg Pro Ala His Arg Val 
20 25 30 

cac cgt get gag cat cgt gaa ccg cgt gcg cca ggg eta cag ccc cct 14 4 

His Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Leu Gin Pro Pro 
35 40 45 

gag ctt cca gac ccg cct gec cgt gec ccg egg 177 
Glu Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
50 55 



<210> 32 
<211> 59 
<212> PRT 

<213> Artificial Sequence 
<400> 32 



Lys 


Leu Val 


Glu 


Leu Val 


Gin His 


His Gin Leu 


Ala 


Val 


Val 


His 


Gin 


1 






5 




10 








15 




Asp 


Phe His 


His 


Asp Arg 


Gly Arg 


Pro Asp Arg 


Pro 


Ala 


His 


Arg 


Val 






20 






25 






30 




His 


Arg Ala 


Glu 


His Arg 


Glu Pro 


Arg Ala Pro 


Gly 


Leu 


Gin 


Pro 


Pro 




35 






40 




45 








Glu 


Leu Pro 


Asp 


Pro Pro 


Ala Arg 


Ala Pro Arg 














50 






55 













<210> 33 
<211> 140 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (140) 

<400> 33 

ccg egg ccc cga ccg ccc cga ggg cat cga gga gga ggg egg cga gcg 4 8 

Pro Arg Pro Arg Pro Pro Arg Gly His Arg Gly Gly Gly Arg Arg Ala 

1 5 10 15 

cga ccg cga ccg cag cac ccg cct ggt gac egg ctt cct gec cct gat 96 
Arg Pro Arg Pro Gin His Pro Pro Gly Asp Arg Leu Pro Ala Pro Asp 
20 25 30 



ctg gga cga cct gcg cag cct gtt cct gtt cag eta cca teg at 
Leu Gly Arg Pro Ala Gin Pro Val Pro Val Gin Leu Pro Ser 
35 40 45 



140 
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<210> 34 

<211> 46 

<212> PRT 

<213> Artificial Sequence 

<400> 34 

Pro Arg Pro Arg Pro Pro Arg Gly His Arg Gly Gly Gly Arg Arg Ala 

1 5 10 15 

Arg Pro Arg Pro Gin His Pro Pro Gly Asp Arg Leu Pro Ala Pro Asp 

20 25 30 

Leu Gly Arg Pro Ala Gin Pro Val Pro Val Gin Leu Pro Ser 
35 40 45 

<210> 35 
<211> 129 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (129) 

<400> 35 

ate gat tgc gcg acc tgc tgc tga teg tgg ccc gca teg tgg age tgc 48 

He Asp Cys Ala Thr Cys Cys * Ser Trp Pro Ala Ser Trp Ser Cys 
1 5 10 15 

tgg gec ggc gcg get ggg aga tec tga agt act ggt gga acc tgc tec 96 
Trp Ala Gly Ala Ala Gly Arg Ser * Ser Thr Gly Gly Thr Cys Ser 
20 25 30 

agt act gga gee agg age tga aga act ctg cag 129 
Ser Thr Gly Ala Arg Ser * Arg Thr Leu Gin 
35 40 



<210> 36 
<211> 40 
' <212> PRT 

<213> Artificial Sequence 

<400> 36 

He Asp Cys Ala Thr Cys Cys Ser Trp Pro Ala Ser Trp Ser Cys Trp 

1 5 10 15 

Ala Gly Ala Ala Gly Arg Ser Ser Thr Gly Gly Thr Cys Ser Ser Thr 

20 25 30 

Gly Ala Arg Ser Arg Thr Leu Gin 
35 40 

<210> 37 
<211> 114 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (114) 
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<400> 37 

ctg cag tga gcc tgc tga acg cca ccg cca teg ccg tgg ccg agg gca 4 8 

Leu Gin * Ala Cys * Thr Pro Pro Pro Ser Pro Trp Pro Arg Ala 
1 5 10 



ccg acc gcg tga teg agg tgg tgc age gca tct ggc gcg gca tec tgc 96 
Pro Thr Ala * Ser Arg Trp Cys Ser Ala Ser Gly Ala Ala Ser Cys 
15 20 25 



aca tec cca ccc gaa ttc 
Thr Ser Pro Pro Glu Phe 
30 35 



<210> 38 
<211> 35 
<212> PRT 

<213> Artificial Sequence 



<400> 38 
Leu Gin Ala Cys Thr Pro Pro Pro 

1 5 
Ala Ser Arg Trp Cys Ser Ala Ser 
20 

Pro Glu Phe 
35 



Ser Pro Trp Pro Arg Ala Pro Thr 

10 15 
Gly Ala Ala Ser Cys Thr Ser Pro 
25 30 



<210> 39 
<211> 41 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (41) 



<400> 39 

gaa ttc gcc agg get teg age gcg ccc tgc tgt aag gat cc 41 
Glu Phe Ala Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 
1 5 10 



<210> 40 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<400> 40 

Glu Phe Ala Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 
1 5 10 



<210> 41 
<211> 506 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (506) 
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<400> 41 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 

gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gec tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 



ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 432 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 
Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

cgc ctg cgc age tgc aac aca teg at 506 
Arg Leu Arg Ser Cys Asn Thr Ser 
165 



<210> 42 
<211> 168 
<212> PRT 

<213> Artificial Sequence 
<400> 42 



Ser 


Ala 


Ala 


Asp Arg 


Leu 


Trp Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 








5 






10 










15 




Trp 


Lys 


Asp 


Ala Thr 


Thr 


Thr Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 






20 






25 










30 




Tyr 


Asp 


Thr 


Glu Val 


His 


Asn Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 




35 








40 








45 
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Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin r,ln 


vox 


v a ± 


Leu 


uiy Asn 


vai 


Thr 


Glu 




50 










55 








60 






Asn 
65 


Phe 


Asn 


Met 


Gly 


Lys 
70 


Asn Ac?n 




val 


oIU 

75 


<ain Met 


HIS 


Glu 


Asp 
o U 


He 


He 


Ser 


Leu 


Trp Asp 


Gin Ser 


Leu 


T 

Lys 


tr ro 


t^ys vai 


Lys 


Leu 


Thr 


Pro 








85 








90 






yo 




Leu 


Cys 


Val 


Thr 


Leu 


Asn Cys 


Thr 


Lys 


Leu 


Lys Asn 


Ser 


Thr 


Asp 


Thr 






100 








105 








1 1 n 
1 1 u 




Asn 


Asn 


Thr 


Arg 


Trp 


Gly Thr Gin 


Glu 


Met 


Lys Asn 


Cys 


Ser 


Phe 


Asn 




115 








120 








125 






He 


Ser 


Thr 


Ser 


Val 


Arg Asn 


Lys 


Met 


Lys 


Arg Glu 


Tyr 


Ala 


Leu 


Phy 


130 










135 








140 






Tyr 


Ser 


Leu 


Asp 


He 


Val Pro 


IlB—Asp Asn Asp Asn 


Thr 


Ser 


Tyr 


145 










150 








155 








160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr Ser 















48 



96 



165 

<210> 43 
<211> 374 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 
<222> (1) . . . (374) 

<400> 43 

tct aga acc aac tgg acc aac acc ctg aag cgc gtg gcc gag aag ctg 

Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 
1 5 10 15 

cgc gag aag ttc aac aac acc acc ate gtg ttc aac cag age tec ggc 
Arg Glu Lys Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly 
20 25 30 

ggc gac ccc gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc 
Gly Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe 
35 40 45 

ttc tac tgc aac acc acc cag ctg ttc aac age acc tgg aac gag acc 
Phe Tyr Cys Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr 
50 55 60 

aac age gag ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc 
Asn Ser Glu Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arq 
65 70 75 80 

ate aag cag ate ate aac atg tgg cag gag gtg ggc aag gcc atg tac 288 
He Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr 
85 go 95 

gcc ccc ccc ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc 336 
Ala Pro Pro He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly 
100 105 no 

ctg ctg ctg acc cgc gac ggc ggc age gac aac teg ag 374 
Leu Leu Leu Thr Arg Asp Gly Gly Ser Asp Asn Ser 
115 120 



144 



192 



240 
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<210> 44 
<211> 124 
<212> PRT 

<213> Artificial Sequence 





<400> 


44 
























Ser 


Arg 


Thr 


Asn 


Tro 


Thr 


Asn 


Thr 


Leu Lv^ 


Arg 


Val 


Ala 


Glu 


Lys 


Leu 


1 








5 








10 










15 




Arg 


Glu 


Lys 


Phe 


Asn 


Asn 


Thr 


Thr 


He Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 








20 










25 








30 




Glv 


Asp 


Pro 


Glu 


He 


Val 


Met 


His 




As n 


Cys 


Gly 


Gly 




tri lc 






35 










40 








45 








Phe 


Tyr 


Cys 


Asn 


Thr 


Thr 


Gin 


Leu 


Phe Asn 


Ser 


Thr 


Trp Asn 


Glu 


Thr 




50 










55 








60 










Asn 


Ser 


Glu 


Gly 


Asn 


He 


Thr 


Ser 


Gly Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


65 










70 








75 










80 


He 


Lys 


Gin 


He 


He 


Asn 


Met 


Trp 


Gin Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 










85 








90 










95 




Ala 


Pro 


Pro 


He 


Gly 


Gly Gin 


He 


Lys Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 








100 










105 








110 






Leu 


Leu 


Leu 


Thr 


Arg Asp Gly 


Gly 


Ser Asp 


Asn 


Ser 














115 










120 

















<210> 45 
<211> 1277 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (1277) 

<400> 45 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 

gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gec tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 238 
He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
85 90 95 



ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 



336 
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Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gec ctg 432 

Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 

■Phe Tyr Ser Leu Asp He Val Pro He-Asp Asn Asp Asn Thr Ser Tyr 

145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 528 

Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 

165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gee ggc ttc 576 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 



gee ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 
Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 



624 



acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gec gag gag gag gtg gtg 720 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 768 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 

245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 

Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 

260 265 • 270 

cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 864 

Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate ggc gac ate cgc cag gec cac tgc aac ate tct aga acc 912 

Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gee gag aag ctg cgc gag aag 960 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1003 

Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 ' 330 335 
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gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 

Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 1104 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 

355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 1152 

Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 

370 375 380 



atrc ate aac atg tgg cag gag gtg ggrraag gee atg tac gee ccc ccc 1200 
He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 
He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
405 410 415 



acc cgc gac ggc ggc age gac aac teg ag 1277 
Thr Arg Asp Gly Gly Ser Asp Asn Ser 
420 425 



<210> 46 

<211> 425 

<212> PRT 

<213> Artificial Sequence 





<400> 


46 


























Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 






Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 








Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu' 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 








100 










105 










110 






Thr 


Asn 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


lie 


Asp 


Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


lie 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 




Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 










190 






Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 






195 










200 










205 








Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 




210 










215 










220 
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Q d -r 
O t£ X. 


Th r- 

inr 


oin 


Leu 


Leu 


Leu 


Asn 


biy 


225 










9?0 

C J \J 






lie 




Cav- 


olU 


Asn 




Th r* 

1 nr 


Asn 


















Leu 




rill 

VjlU 




v ai 


UiU 


T 1 0 

lie 


Asn 








9 fid 

c. \j U 












Lys 


JC JL 


T 1 0 

lie 


nlS 


lie 


biy 


Pro 






9m 

£ t D 










9 p n 


Asp 


He 


T 1 A 


C\ \t 
uiy 


A c? r~\ 
nop 


Tip 
.L -L C 


Arg 


CI n 




290 










9 




As n 


Trn 
1 L P 


Th r 


A T~1 

Moll 




Leu 


Lys 


Arg 


305 










^1 n 








Asn 


A c; r^i 


Thr 
1111 


Thr- 
ill! 


lie 


v ai 


trne 










^9 ^ 
J 










lie 


V ci 1 


Mo t- 

t v ie u 


nlS 


oer 


nu - 

rne 


Asn 








i zi n 
0 ft u 










A c n 


Th y 

1 nr 


inr 


Pin 

bin 


Leu 


Dh a. 

fne 


Asn 


Ser 






0 0 j 










^ ^n 




A cri 
rvoll 


T 1 0 

lie 


Th r- 

1 nr 




biy 


Th *» 

i nr 


T 1 rt 

lie 




J f \J 










.3/0 




He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


385 










390 






11c 


Gly 


Gly 


bin 


lie 


Lys 


Cys 


Leu 










405 








Thr 


Arg 


Asp 


Gly 


Gly 


oer 


Asp 


Asn 








420 












<210> 


47 












<211> 


1277 










<212> 


DNA 












<213> 


Artificial Sequence 




<220> 














<221> 


CDS 












<222> 


(1) - 


. . (1277) 








<400> 


47 










gc t 


age 


gcg 


gec 


gac 


cgc 


ctg 


tgg 


Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


1 








5 










tgg 


aag 


gac 


gec 


acc 


acc 


acc 


Val 


Trp 


Lys 


Asp Ala 


Thr 


Thr 


Thr 








20 










gec 


tac 


gac 


acc 


gag 


gtg 


cac 


aac 


Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 






35 










40 


ccc 


ace 


gac 


ccc 


aac 


ccc 


cag 


gag 


Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 




50 










55 




aa c 


ttc 


aac 


atg 


ggc 


aag 


aac 


aac 


Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


65 










70 






ate 


ate 


age 


ctg 


tgg 


gac 


cag 


age 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 
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Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 






235 










240 


Asn 


Ala 


Lys 


Thr 


He 


lie 


Val 


Gin 




250 










0 c c 




Cys 


Thr 


Arg 


Pro 


Asn 


Asn 


Asn 


Thr 


265 










9 "7 fl 






Gly Arg Ala 


Pne 


Tyr 


Thr 


Thr 


Gly 


















Ala 


His 


Cys 


Asn 


lie 


Ser 


Arg 


W V* w 

Thr 








JUU 










Val 


Ala 


Glu 


Lys 


Leu 


Arg 


Glu 


Lys 






315 










ion 


Asn- -Gin 


Ser 


oer 


biy 


Gly Asp 


Pro 




330 










335 




Cys 


Gly 


Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 


345 










350 






Thr 


Trp Asn 


Glu 


Thr 


Asn 


Ser 


Glu 










365 








Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 








380 












Lys 


Aia 


Met 


Tyr 


Ala 


Pro 


Pro 






395 










400 


Ser 


Asn 


lie 


Thr 


Gly 


Leu 


Leu 


Leu 




a 1 n 

H 1 U 










410 




Ser 
















425 
















gtg 


acc 


gtg 


tac 


tac 


ggc 


gtg 


ccc 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 




10 










15 




ctg 


ttc 


tgc 


gec 


age 


gac 


gee 


aag 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp Ala 


Lys 


25 










30 






gtg 


tgg 


gec 


acc 


cac 


gcg 


tgc 


gtg 


Val 


Trp Ala 


Thr 


His 


Ala 


Cys 


Val 










45 








gtg 


gtg 


ctg 


ggc 


aac 


gtg 


acc 


gag 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 








60 










atg 


gtg 


gag 


cag 


atg 


cac 


gag 


gat 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 






75 










80 


ctg 


aag 


ccc 


tgc 


gtg 


aag 


ctg 


acc 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 
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85 90 95 

ccc ctg tgc gtg acc ctg caa tgc acc aag ctg aag cag age acc gac 336 

Pro Leu Cys Val Thr Leu Gin Cys Thr Lys Leu Lys Gin Ser Thr Asp 
100 105 110 

acc cag aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 

Thr Gin Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age. acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 432 

Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 

Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 

145 150 , 155 160 



cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 
Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 

ICC -i-in 



165 170 175 



cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 
Arg Lys Ser lie His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 



528 



gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gee ggc ttc 57 6 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gec ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 

Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 

210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gee gag gag gag gtg gtg 720 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 

225 230 235 240 

ate aga tct gag aac ttc acc aac aac gee aag acc ate ate gtg cag 768 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 

Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 



864 



gac ate ate ggc gac ate cgc cag gee cac tgc aac ate tct aga acc 912 

Asp He lie Gly Asp lie Arg Gin Ala His Cys Asn He Ser Ara Thr 

290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gec gag aag ctg cgc gag aag 960 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 

305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 
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Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 
Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac ace ace cag ctg ttc aac age ace tgg aac gag ace aac age gag 1104 
Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc ace ate acc ctg ccc tgc cgc ate aag cag 1152 
«ry Asn He Thr Ser Gly Thr He Thr-Leu Pro Cys Arg He Lys Gin 
370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gec atg tac gee ccc ccc 1200 
He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 
He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
405 410 415 

acc cgc gac ggc ggc age gac aac teg ag 1277 
Thr Arg Asp Gly Gly Ser Asp Asn Ser 
420 425 



<210> 48 
<211> 425 
<212> PRT 

<213> Artificial Sequence 



<400> 48 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp Val 


Thr 


Val 


Tyr Tyr Gly Val 


Pro 


1 








5 










10 








15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala Ser 


Asp 


Ala 


Lys 


Ala 






20 










25 








30 




Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr His 


Ala 


Cys 


Val 






35 










40 








45 






Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn Val 


Thr 


Glu 




50 










55 










60 








Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin Met 


His 


Glu 


Asp 


65 










70 










75 








80 


He 


lie 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys Val 


Lys 


Leu 


Thr 










85 










90 






95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Gin 


Cys 


Thr 


Lys 


Leu 


Lys Gin 


Ser 


Thr 


Asp 








100 










105 








110 




Thr 


Gin 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys Asn 


Cys 


Ser 


Phe 






115 










120 








125 






Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg Glu 


Tyr 


Ala 


Leu 




130 










135 










140 






Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


He 


Asp 


Asn 


Asp Asn 


Thr 


Ser 


Tyr 


145 










150 










155 








160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He 


Thr 


Gin Ala 


Cys 


Pro 


Lys 










165 










170 






175 


Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala Pro 


Ala 


Gly 


Phe 








180 










185 








190 




Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly Thr 


Gly 


Pro 


Cys 
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195 200 205 



i nr 




Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 






o i a 
210 










215 










220 












Ser 


i nr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 




n o c 

22b 










230 










235 










240 




lie 


Arg 


ber 


Glu 


Asn 


Pne 


Thr 


Asn 


Asn 


Ala 


Lys 


Thr 


lie 


He 


Val 


Gin 












24 5 










250 










255 






Leu 


Asn 




oer 


vai 


biU 


T 1 A 

lie 


Asn 


Cys 


Thr Arg 


Pro 


Asn 


Asn 


Asn 


Tnr 










A C A 

2 60 










265 










270 








Arg 


Lys 


C n v~ 

ber 


lie 


His 


lie 


Gly 


Pro 


Gly Arg Ala 


Pne 


Tyr 


Thr 


Thr 


Gly 








one 

2 ID 










A O A 

280 










285 










Asp 


lie 


lie 


biy 


ASp 


lie 


Arg 


Gin 


Ala 


His 


Cys 


Asn 


He 


Ser 


Arg 


Thr 






2 y u 










one 

295 










O A A 

300 












Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala 


Glu 


Lys 


Leu Arg 


Glu 


Lys 




305 










310 










315 










320 




Pne 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 


Gly Asp 


Pro 












325 










330 










i c; 






blU 


lie 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 










340 










345 










350 








Asn 


I nr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 


Asn 


O a »~ 

oer 


Glu 








355 










360 










365 










bly 


Asn 


lie 


Thr 


Ser 


Gly 


Thr 


He 


i nr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 






370 










375 










380 












He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


oiy 


Lys 


Ala 


Met 


Tyr Ala 


Pro 


Pro 


• 


385 










390 










"3QC 










400 




He 


Gly 


Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


lie 


Thr 


biy 


Leu 


Leu 


Leu 












405 










a i n 
4 1 u 










4 10 






Thr 


Arg 


Asp 


Gly 


Gly 


Ser 


Asp 


Asn 


OCI 
























420 






























<210> 


49 






























<211> 


144 






























<212> 


PRT 






























<213> 


Artificial Sequence 






















<400> 


49 




























ytu 


age 


gcg 


gee 


gac 


cgc 


ctg 


tgg 


gtg 


acc 


gtg 


-H a r* 
LdC 


tac 


ggc 


gtg 


ccc 


A Q 
H 0 


a*+* a* 


tgg 


aag 


gac 


gee 


acc 


acc 


acc 


ctg 


ttc 


tgc 


gec 


age 


gac 


gee 


aag 


Q C 


gee 


tac 


gac 


acc 


gag 


gtg 


cac 


aac 


gtg 


tgg 


gee 


acc 


cac 


gcg 


tgc 


gtg 


1 A A 

14 4 


ccc 


acc 


gac 


ccc 


aac 


ccc 


cag 


gag 


gtg 


gtg 


ctg 


ggc 


aac 


gtg 


acc 


gag 


1 A O 

192 


33C 


ttc 


aac 


atg 


ggc 


aag 


aac 


aac 


atg 


gtg 


gag 


cag 


atg 


cac 


gag 


gat 


A A A 

24 0 




ate 


age 


ctg 


tgg 


gac 


cag 


age 


ctg 


aag 


ccc 


tgc 


gtg 


aag 


ctg 


acc 


2o o 


CCC 


ctg 


tgc 


gtg 


acc 


ctg 


caa 


tgc 


acc 


aag 


ctg 


aag 


cag 


age 


acc 


gac 


1 o c 

336 




cag 


aac 


acc 


cgc 


tgg 


ggc 


acc 


cag 


gag 


atg 


aag 


aac 


tgc 


age 


+- 4- j«i 

ttc 


3o 4 


part 

oay 


ate 


age 


acc 


age 


gtg 


cgc 


aac 


aag 


atg 


aag 


cgc 


gag 


tac 


gee 


ctg 


A T O 




tac 


age 


ctg 


gac 


ate 


gtg 


ccc 


ate 


gac 


aac 


gac 


cag 


acc 


age 


tac 


a q n 
4oU 


CCJC 


ctg 


cgc 


age 


tgc 


aac 


aca 


teg 


ate 


ate 


acc 


cag 


gee 


tgc 


ccc 


aag 


coo 
32 o 


9tg 


age 


ttc 


gag 


ccc 


ate 


ccc 


ate 


cac 


ttc 


tgc 


gec 


ccc 


gee 


ggc 


ttc 


C "7 C 

57 6 


gee 


ate 


ctg 


aag 


tgc 


aac 


aac 


aag 


acc 


ttc 


aac 


ggc 


acc 


ggc 


ccc 


tgc 


624 


Cl *w 


aac 


gtg 


age 


acc 


gtg 


cag 


tgc 


acc 


cac 


gga 




cgc 


ccc 


gtg 


gtg 


O / 2 


age 


acc 


cag 


ctg 


ctg 


ctg 


aac 


ggc 


age 


ctg 


gee 


gag 


gag 


gag 


gtg 


gtg 


"7 O A 

/2U 




aga 


tct 


gag 


aac 


ttc 


acc 


aac 


aac 


gee 


aag 


acc 


ate 


ate 


gtg 


cag 


ICQ 

/ bo 


ctg 


aac 


gag 


age 


gtg 


gag 


ate 


aac 


tgc 


acc 


cgc 


ccc 


aac 


aac 


aac 


acc 


bio 


cgc 


aag 


age 


ate 


cac 


ate 


ggc 


cct 


ggc 


cgc 


gee 


ttc 


tac 


acc 


acc 


ggc 


8 6 m 


gac 


ate 


ate 


ggc 


gac 


ate 


cgc 


cag 


gec 


cac 


tgc 


aac 


ate 


tct 


aga 


acc 


912 


aac 


tgg 


acc 


aac 


acc 


ctg 


aag 


cgc 


gtg 


gee 


gag 


aag 


ctg 


cgc 


gag 


aag 


960 


ttc 


aac 


aac 


acc- 


acc 


ate 


gtg 


ttc 


aac 


cag 


age 


tec 


ggc 


ggc 


gac 


ccc 


1008 


gag 


ate 


gtg 


atg 


cac 


age 


ttc 


aac 


tgc 


ggc 


ggc 


gag 


ttc 


ttc 


tac 


tgc 


1056 


aac 


acc 


acc 


cag 


ctg 


ttc 


aac 


age 


acc 


tgg 


aac 


gag 


acc 


aac 


age 


gag 


1104 
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ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 1152 
ate ate aac atg tgg cag gag gtg ggc aag gec atg tac gec ccc ccc 1200 
ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 
acc cgc gac ggc ggc age gac aac teg ag 1277 

<210> 50 
<211> 425 
<212> PRT 

<213> Artificial Sequence 
<400> 50 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

5 10- 15 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 

35 40 45 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 

50 55 60 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

Pro Leu Cys Val Thr Leu Gin Cys Thr Lys Leu Lys Gin Ser Thr Asp 

100 105 no 

Thr Gin Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

Gin He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Gin Thr Ser Tyr 
145 150 155 160 

Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 

165 170 175 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 

210 215 220 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 

245 250 255 

Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 

260 265 270 

Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 

275 280 285 

Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 

290 295 300 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 330 335 

Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

340 345 350 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 

355 360 365 

Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 
370 375 380 
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lie He Asn Met Trp Gin Glu Val 
385 390 
He Gly Gly Gin He Lys Cys Leu 
405 

Thr Arg Asp Gly Gly Ser Asp Asn 
420 



25 

Gly Lys Ala Met Tyr Ala Pro Pro 
395 400 
Ser Asn He Thr Gly Leu Leu Leu 
410 415 

Ser 
425 



<210> 51 
<211> 144 
<212> PRT 

<213> Artificial Sequence 



<400> 51 



get 


age 


gcg 


gee 


gac 


cgc 


ctg 


tgg 


gtg 


acc 


gtg 


tac 


tac 


qqc 


qtq 

3 3 


ccc 


48 


gtg 


tgg 


aag 


gac 


gee 


acc 


acc 


acc 


ctg 


ttc 


tgc 


gee 


age 


gac 


gec 


aaq 


96 


gec 


tac 


gac 


acc 


gag 


gtg 


cac 


aac 


gtg 


tgg 


gec 


acc 


cac 


gcg 


tgc 


qtq 

3 3^ 


144 


ccc 


acc 


gac 


ccc 


aac 


ccc 


cag 


gag 


gtg 


gtg 


ctg 


ggc 


aac 


qtq 

33 


acc 


Qaq 

3 *-* 3 


192 


aac 


ttc 


aac 


atg 


ggc 


aag 


aac 


aac 


atg 


gtg 


gag 


cag 


atq 


cac 


qaq 

3 3 


qat 


240 


ate 


ate 


age 


ctg 


tgg 


gac 


cag 


age 


ctg 


aag 


ccc 


tgc 


gtg 


aag 


eta 


acc 


■288 


ccc 


ctg 


tgc 


gtg 


acc 


ctg 


aac 


tgc 


acc 


aag 


ctg 


aag 


aac 


age 


acc 


qac 

3 w 


336 


ace 


aac 


aac 


acc 


cgc 


tgg 


ggc 


acc 


cag 


gag 


atg 


aag 


aac 


tqc 


aac 


ttc 


384 


cag 


ate 


age 


acc 


age 


gtg 


cgc 


aac 


aag 


atg 


aaq 


CQC 


qaq 

3 " 3 


tac 


acc 


ctg 


4 32 


ttc 


tac 


age 


ctg 


gac 


ate 


gtg 


ccc 


ate 


gac 


aac 


gac 


cag 


acc 


age 


tac 


480 


cgc 


ctg 


cgc 


age 


tgc 


aac 


aca 


teg 


ate 


ate 


acc 


cag 


gec 


tgc 


ccc 


aag 


528 


gtg 


age 


ttc 


gag 


ccc 


ate 


ccc 


ate 


cac 


ttc 


tgc 


gec 


ccc 


gee 


ggc 


ttc 


576 


gee 


ate 


ctg 


aag 


tgc 


aac 


aac 


aag 


acc 


ttc 


aac 


ggc 


acc 


ggc 


ccc 


tgc 


624 


ace 


aac 


gtg 


age 


acc 


gtg 


cag 


tgc 


acc 


cac 


gga 


att 


cgc 


ccc 


gtg 


gtg 


672 


age 


acc 


cag 


ctg 


ctg 


ctg 


aac 


ggc 


age 


ctg 


gec 


gag 


gag 


gag 


gtg 


gtg 


720 


ate 


aga 


tct 


gag 


aac 


ttc 


acc 


aac 


aac 


gee 


aag 


acc 


ate 


ate 


gtg 


cag 


768 


ctg 


aac 


gag 


age 


gtg 


gag 


ate 


aac 


tgc 


acc 


cgc 


ccc 


aac 


aac 


aac 


acc 


816 


cgc 


aag 


age 


ate 


cac 


ate 


ggc 


cct 


ggc 


cgc 


gee 


ttc 


tac 


acc 


acc 


ggc 


864 


gac 


ate 


ate 


ggc 


gac 


ate 


cgc 


cag 


gec 


cac 


tgc 


aac 


ate 


tct 


aga 


acc 


912 


aac 


tgg 


acc 


aac 


acc 


ctg 


aag 


cgc 


gtg 


gec 


gag 


aag 


ctg 


cgc 


gag 


aag 


960 


ttc 


aac 


aac 


acc 


acc 


ate 


gtg 


ttc 


aac 


cag 


age 


tec 


ggc 


ggc 


gac 


ccc 


1008 


gag 


ate 


gtg 


atg 


cac 


age 


ttc 


aac 


tgc 


ggc 


ggc 


gag 


ttc 


ttc 


tac 


tgc 


1056 


aac 


acc 


acc 


cag 


ctg 


ttc 


aac 


age 


acc 


tgg 


aac 


gag 


acc 


aac 


age 


gag 


1104 


ggc 


aac 


ate 


act 


agt 


ggc 


acc 


ate 


acc 


ctg 


ccc 


tgc 


cgc 


ate 


aag 


cag 


1152 


ate 


ate 


aac 


atg 


tgg 


cag 


gag 


gtg 


ggc 


aag 


gee 


atg 


tac 


gec 


ccc 


ccc 


1200 


ate 


ggc 


ggc 


cag 


ate 


aag 


tgc 


ctg 


age 


aac 


ate 


acc 


ggc 


ctg 


ctg 


ctg 


1248 


ace 


cgc 


gac 


ggc 


ggc 


age 


gac 


aac 


teg 


ag 














1277 



<210> 52 
<211> 425 
<212> PRT 

<213> Artificial Sequence 



<400> 52 
Ala Ser Ala Ala 
1 

Val Trp Lys Asp 
20 

Ala Tyr Asp Thr 
35 

Pro Thr Asp Pro 
50 

Asn Phe Asn Met 
65 

He He Ser Leu 



Asp Arg Leu Trp 
5 

Ala Thr Thr Thr 

Glu Val His Asn 
40 

Asn Pro Gin Glu 
55 

Gly Lys Asn Asn 
70 

Trp Asp Gin Ser 
85 



Val Thr Val Tyr 
10 

Leu Phe Cys Ala 
25 

Val Trp Ala Thr 

Val Val Leu Gly 
60 

Met Val Glu Gin 
75 

Leu Lys Pro Cys 
90 



Tyr Gly Val Pro 
15 

Ser Asp Ala Lys 
30 

His Ala Cys Val 
45 

Asn Val Thr Glu 

Met His Glu Asp 
80 

Val Lys Leu Thr 
95 
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Pro Leu Cys Val 
100 

Thr Asn Asn Thr 

115 ■ 
Gin He Ser Thr 
130 

Phe Tyr Ser Leu 
145 

Arg Leu Arg Ser 

Val Ser Phe Glu 
180 

"Stfra He Leu Lys 
195 

Thr Asn Val Ser 
210 

Ser Thr Gin Leu 
225 

He Arg Ser Glu 

Leu Asn Glu Ser 
260 

Arg Lys Ser He 
275 

Asp He lie Gly 
290 

Asn Trp Thr Asn 
305 

Phe Asn Asn Thr 

Glu He Val Met 
340 

Asn Thr Thr Gin 
355 

Gly Asn He Thr 
370 

He He Asn Met 
385 

He Gly Gly Gin 

Thr Arg Asp Gly 
420 



Thr Leu Asn Cys 

Arg Trp Gly Thr 
120 

Ser Val Arg Asn 
135 

Asp He Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro He Pro He 

Cys Asn Asn Lys 
200 

Thr Val Gin Cys 
215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 

Val Glu He Asn 

His He Gly Pro 
280 

Asp He Arg Gin 
295 

Thr Leu Lys Arg 
310 

Thr He Val Phe 
325 

His Ser Phe Asn 

Leu Phe Asn Ser 
360 

Ser Gly Thr He 
375 

Trp Gin Glu Val 
390 

He Lys Cys Leu 
405 

Gly Ser Asp Asn 



Thr Lys Leu Lys 
105 

Gin Glu Met Lys 

Lys Met Lys Arg 
140 

He Asp Asn Asp 
155 

He lie Thr Gin 
170 

His Phe Cys Ala 
185 

Thr-Phe Asn Gly 

Thr His Gly He 
220 

Ser Leu Ala Glu 
235 

Asn Ala Lys Thr 
250 

Cys Thr Arg Pro 
265 

Gly Arg Ala Phe 

Ala His Cys Asn 
300 

Val Ala Glu Lys 
315 

Asn Gin Ser Ser 
330 

Cys Gly Gly Glu 
345 

Thr Trp Asn Glu 

Thr Leu Pro Cys 
380 

Gly Lys Ala Met 
395 

Ser Asn He Thr 
410 

Ser 
425 



Asn Ser Thr Asp 
110 

Asn Cys Ser Phe 
125 

Glu Tyr Ala Leu 

Gin Thr Ser Tyr 
160 

Ala Cys Pro Lys 
175 

Pro Ala Gly Phe 
190 

Thr Gly Pro Cys 
205 

Arg Pro Val Val 

Glu Glu Val Val 
240 

He He Val Gin 
255 

Asn Asn Asn Thr 
270 

Tyr Thr Thr Gly 
285 

He Ser Arg Thr 

Leu Arg Glu Lys 
320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 
365 

Arg He Lys Gin 

Tyr Ala Pro Pro 
400 

Gly Leu Leu Leu 
415 



<210> 53 
<211> 432 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (432) 



<400> 53 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

1 5 10 is 



etc egg caa get gat ctg cac cac ggc cgt gee ctg gaa cgc cag ctg 
Leu Arg Gin Ala Asp Leu His His Gly Arg Ala Leu Glu Arg Gin Leu 
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20 25 30 

gag caa caa gaa cct gag cca gat ttg gga caa cat gac ctg gat gga . 144 
Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 
35 40 45 

gtg gga gcg cga gat cag caa eta cac cga gat cat eta cag cct gat 192 
Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 
50 55 60 

cga gga gag cca gaa cca gca gga gaa gaa cga get gga cct get cca 240 
Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 

70 75 80 

get gga caa gtg ggc aag ctt gtg gaa ctg gtt caa cat cac caa ctg 288 
Ala Gly Gin Val Gly Lys Leu Val Glu Leu Val Gin His His Gin Leu 
85 90 95 

get gtg gta cat caa gat ttt cat cat gat cgt ggg egg cct gat egg 336 
Ala Val Val His Gin Asp Phe His His Asp Arg Gly Arg Pro Asp Arg 
100 105 no 

cct gcg cat cgt gtt cac cgt get gag cat cgt gaa ccg cgt gcg cca 384 
Pro Ala His Arg Val His Arg Ala Glu His Arg Glu Pro Arg Ala Pro 
115 120 125 

ggg eta cag ccc cct gag ctt cca gac ccg cct gee cgt gec ccg egg 4 32 

Gly Leu Gin Pro Pro Glu Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
130 135 140 



<210> 54 
<211> 144 
<212> PRT 

<213> Artificial Sequence 





<400> 


54 


























Ser 


Arg 


Ala 


Leu 


Pro 


Pro 


Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 


Gly 


Leu 


1 








5 










10 










15 




Leu 


Arg 


Gin 


Ala 


Asp 


Leu 


His 


His 


Gly Arg Ala 


Leu 


Glu 


Arg 


Gin 


Leu 








20 










25 










30 






Glu 


Gin 


Gin 


Glu 


Pro 


Glu 


Pro 


Asp 


Leu 


Gly Gin 


His 


Asp 


Leu 


Asp 


Gly 






35 










40 










45 




Val 


Gly 


Ala 


Arg 


Asp 


Gin 


Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 


Pro 


Asp 




50 










55 










60 








Arg 


Gly 


Glu 


Pro 


Glu 


Pro 


Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 


Ala 


Pro 


65 










70 










75 








80 


Ala 


Gly 


Gin 


Val 


Gly 
85 


Lys 


Leu 


Val 


Glu 


Leu 
90 


Val 


Gin 


His 


His 


Gin 
95 


Leu 


Ala 


Val 


Val 


His 


Gin 


Asp 


Phe 


His 


His 


Asp 


Arg 


Gly Arg 


Pro 


Asp 


Arg 








100 










105 










110 




Pro 


Ala 


His 


Arg 


Val 


His 


Arg 


Ala 


Glu 


His 


Arg 


Glu 


Pro 


Arg Ala 


Pro 






115 










120 










125 








Gly 


Met 


Gin 


Pro 


Pro 


Glu 


Leu 


Pro 


Asp 


Pro 


Pro 


Ala 


Arg 


Val 


Thr 


Asp 




130 










135 










140 









<210> 
<211> 
<212> 



55 

434 

DNA 
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<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (434) 

<400> 55 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 48 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

1 5 10 15 

etc egg caa get gat ctg cac cac ggc cgt gec ctg gaa cgc cag ctg 96 
, -feeu Arg Gin Ala Asp Leu His His Gly -Arg Ala Leu Glu Arg Gin Leu 
20 25 30 ' 

gag caa caa gaa cct gag cca gat ttg gga caa cat gac ctg gat gga 14 4 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 
35 40 45 

gtg gga gcg cga gat cag caa eta cac cga gat cat eta cag cct gat 192 
Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 
50 55 60 

cga gga gag cca gaa cca gca gga gaa gaa cga get gga cct get cca 240 
Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 
65 70 75 80 

get gga caa gtg ggc aag ctt gtg gaa ctg gtt caa cat cac caa ctg 288 
Ala Gly Gin Val Gly Lys Leu Val Glu Leu Val Gin His His Gin Leu 
85 90 95 

get gtg gta cat caa gat ttt cat cat gat cgt ggg egg cct gat egg 336 
Ala Val Val His Gin Asp Phe His His Asp Arg Gly Arg Pro Asp Arg 
100 105 110 

cct gcg cat cgt gtt cac cgt get gag cat cgt gaa ccg cgt gcg cca 384 
Pro Ala His Arg Val His Arg Ala Glu His Arg Glu Pro Arg Ala Pro 
115 120 125 

ggg atg cag ccc cct gag ctt cca gac ccg cct gec cgt gtg acg gat 432 
Gly Met Gin Pro Pro Glu Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 
130 135 140 



cc 



<210> 56 
<211> 144 
<212> PRT 

<213> Artificial Sequence 
<400> 56 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

1 5 10 15 

Leu Arg Gin Ala Asp Leu His His Gly Arg Ala Leu Glu Arg Gin Leu 

20 25 30 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 

35 40 45 

Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 
50 55 60 



434 
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Arg 


o j. y 


Pin 


Pro 


pin 

(jIU 


Pro 


Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 


Ala 


Pro 


65 


Gly 








70 










to 








80 


Ala 


C 1 n 


v a jl 


r l w 
oiy 


Lys 


Leu 


val 


Glu 


Leu 


Val 


Gin 


His 


His 


Gin 


Leu 










0 o 










yu 










95 




Ala 


Val 


Val 


Hie 


(a in 


Asp 


rfie 


nlS 


HIS 


Asp 


Arg 


Gly 


Arg 


Pro 


Asp 


Arg 








100 










105 










110 




Pro 


Ala 


His 


Arg 


Val 


His 


Arg 


Ala 


Glu 


His 


Arg 


Glu Pro Arg Ala Pro 






115 










120 










125 








Gly 


Met 


Gin 


Pro 


Pro 


Glu 


Leu 


Pro 


Asp 


Pro 


Pro 


Ala 


Arg 


Val 


Thr 


Asp 




130 










135 










140 







<210> 57 
<211> 281 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 
<222> (1) . . . (281) 

<400> 57 

tct aga gcg eta cct cca gga 

Ser Arg Ala Leu Pro Pro Gly 
1 5 

etc egg caa get gat ctg cac 

Leu Arg Gin Ala Asp Leu His 
20 

gag caa caa gaa cct gag cca 

Glu Gin Gin Glu Pro Glu Pro 
35 

gtg gga gcg cga gat cag caa 

Val Gly Ala Arg Asp Gin Gin 

50 55 

cga gga gag cca gaa cca gca 
Arg Gly Glu Pro Glu Pro Ala 
65 70 

get gga caa gtg ggc aag ctt 
Ala Gly Gin Val Gly Lys Leu 

85 90 



cca gcg ctt cct ggg cat gtg ggg ctg 4 8 

Pro Ala Leu Pro Gly His Val Gly Leu 

10 15 

cac ggc cgt gec ctg gaa cgc cag ctg 96 

His Gly Arg Ala Leu Glu Arg Gin Leu 

25 30 

gat ttg gga caa cat gac ctg gat gga 14 4 

Asp Leu Gly Gin His Asp Leu Asp Gly 
40 45 

eta cac cga gat cat eta cag cct gat 192 

Leu His Arg Asp His Leu Gin Pro Asp 
60 

gga gaa gaa cga get gga cct get cca 24 0 

Gly Glu Glu Arg Ala Gly Pro Ala Pro 

75 80 

gtg tga ctg att gag gat cc 281 
Val * Leu He Glu Asp 



<210> 58 
<211> 92 
<212> PRT 

<213> Artificial Sequence 

<400> 58 
Ser Arg Ala Leu Pro Pro Gly Pro 

1 5 
Leu Arg Gin Ala Asp Leu His His 
20 

Glu Gin Gin Glu Pro Glu Pro Asp 

35 40 
Val Gly Ala Arg Asp Gin Gin Leu 



Ala Leu Pro Gly His Val Gly Leu 

10 15 
Gly Arg Ala Leu Glu Arg Gin Leu 
25 30 
Leu Gly Gin His Asp Leu Asp Gly 
45 

His Arg Asp His Leu Gin Pro Asp 
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50 55 60 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 
65 70 75 80 

Ala Gly Gin Val Gly Lys Leu Val Leu He Glu Asp 
85 90 

<210> 59 
<211> 272 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (272) 

<400> 59 

ate gat tgc gcg acc tgc tgc tga teg tgg ccc gca teg tgg age tgc 

He Asp Cys Ala Thr Cys Cys * Ser Trp Pro Ala Ser Trp Ser Cys 

1 5 io 



15 



tgg gec ggc gcg get ggg aga tec tga agt act ggt gga acc tgc tec 
Trp Ala Gly Ala Ala Gly Arg Ser * Ser Thr Gly Gly Thr Cys Ser 
20 25 30 

agt act gga gec agg age tga aga act ctg cag tga gee tgc tga acg 
Ser Thr Gly Ala Arg Ser * Arg Thr Leu Gin * Ala Cys * Thr 
35 40 



48 



96 



144 



cca ccg cca teg ccg tgg ccg agg gca ccg acc gcg tga teg agg tgg 192 

Pro Pro Pro Ser Pro Trp Pro Arg Ala Pro Thr Ala * Ser Arg Trp 

4 5 50 55 

tgc age gca tct ggc gcg gca tec tgc aca tec cca ccc gaa ttc gec 240 

Cys Ser Ala Ser Gly Ala Ala Ser Cys Thr Ser Pro Pro Glu Phe Ala 

60 65 70 

agg get teg age gcg ccc tgc tgt aag gat cc 272 

Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 

75 80 



<210> 60 
<211> 84 
<212> PRT 

<213> Artificial Sequence 



<400> 60 



He 


Asp 


Cys 


Ala 


Thr 


Cys 


Cys 


Ser 


1 








5 






Ala 


Gly 


Ala 


Ala 


Gly 


Arg 


Ser 


Ser 








20 










Gly 


Ala 


Arg 


Ser 


Arg 


Thr 


Leu 


Gin 






35 










40 


Trp 


Pro 


Arg 


Ala 


Pro 


Thr 


Ala 


Ser 




50 










55 




Ala 


Ser 


Cys 


Thr 


Ser 


Pro 


Pro 


Glu 


65 










70 






Cys 


Cys 


Lys 


Asp 











Trp Pro Ala Ser Trp Ser Cys Trp 

10 15 
Thr Gly Gly Thr Cys Ser Ser Thr 
25 30 
Ala Cys Thr Pro Pro Pro Ser Pro 
45 

Arg Trp Cys Ser Ala Ser Gly Ala 
60 

Phe Ala Arg Ala Ser Ser Ala Pro 
75 80 
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<210> 61 

<211> 798 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (798) 

<400> 61 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 48 

EeTi Glu Gin Arg Gin Gly Asp Phe Pro~"Prb Arg Arg Arg Arg His Ala 
1 5 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gee cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

cga gaa gcg cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc agg cag cac cat ggg cgc cgc cag cct gac cct gac cgt gca 240 
Arg Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg cca get get gag egg cat cgt gca gca gca gaa caa cct get 288 
Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 
85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gac cgt gtg ggg 336 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 110 

cat caa gca get cca ggc ccg cgt get ggc tct aga gcg eta cct cca 384 
His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 
115 120 125 

gga cca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 4 32 

Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 

cac cac ggc cgt gee ctg gaa cgc cag ctg gag caa caa gaa cct gag 4 80 

His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

cca gat ttg gga caa cat gac ctg gat gga gtg gga gcg cga gat cag 528 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 175 

caa eta cac cga gat cat eta cag cct gat cga gga gag cca gaa cca 57 6 

Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 
180 185 190 
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gca gga gaa gaa cga get gga cct get cca get gga caa gtg ggc aag 624 

Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro Ala Gly Gin Val Gly Lys 
195 200 205 

ctt gtg gaa ctg gtt caa cat cac caa ctg get gtg gta cat caa gat 672 

Leu Val Glu Leu Val Gin His His Gin Leu Ala Val- Val His Gin Asp 

210 215 220 

ttt cat cat gat cgt ggg egg cct gat egg cct gcg cat cgt gtt cac 720 

Phe His His Asp Arg Gly Arg Pro Asp Arg Pro Ala His Arg Val His 
225 230 235 240 

cgt get gag cat cgt gaa ccg cgt gcg"cc*a ggg eta cag ccc cct gag 7 68 

Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Leu Gin Pro Pro Glu 
245 250 255 

ctt cca gac ccg cct gee cgt gec ccg egg 798 

Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
260 265 



<210> 62 
<211> 266 
<212> PRT 

<213> Artificial Sequence 



<400> 62 



Leu 


Glu 


Gin 


Arg 


Gin 


Gly 


Asp 


Phe 


1 








5 








Arg 


Gin 


Leu 


Ala 


Gin 


Arg 


Ala 


Val 








20 










Ala 


Pro 


Gly 


His 


Arg 


Pro 


His 


Gin 






35 










40 


Arg 


Glu 


Ala 


Arg 


Arg 


Gly 


His 


Arg 




50 










55 




Arg 


Cys 


Arg 


Gin 


His 


His 


Gly 


Arg 


65 










70 






Gly 


Pro 


Pro 


Ala 


Ala 


Glu 


Arg 


His 










85 








Ala 


Arg 


His 


Arg 


Gly 


Pro 


Ala 


Ala 








100 










His 


Gin 


Ala 


Ala 


Pro 


Gly 


Pro 


Arg 






115 










120 


Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 




130 










135 




His 


His 


Gly 


Arg 


Ala 


Leu 


Glu 


Arg 


145 










150 






Pro 


Asp 


Leu 


Gly 


Gin 


His 


Asp 


Leu 










165 








Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 








180 










Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 






195 










200 


Leu 


Val 


Glu 


Leu 


Val 


Gin 


His 


His 




210 










215 




Phe 


His 


His 


Asp 


Arg 


Gly 


Arg 


Pro 


225 










230 






Arg 


Ala 


Glu 


His 


Arg 


Glu 


Pro 


Arg 



245 



Pro 


Pro 
10 


Arg 


Arg 


Arg 


Arg 


His 
15 


Ala 


Gin 


Val 


Gin 


Gly 


Gly 


Glu 


Asp 


Arg 


25 










30 






Gly 


Gin 


Ala 


Pro 


Arg 
45 


Gly 


Ala 


Ala 


Arg 


Tyr 


Val 


Pro 
60 


Arg 


Leu 


Pro 


Gly 


Arg 


Gin 


Pro 
75 


Asp 


Pro 


Asp 


Arg 


Ala 
80 


Arg 


Ala 
90 


Ala 


Ala 


Glu 


Gin 


Pro 
95 


Ala 


Pro 


Ala 


Pro 


Ala 


Asp 


Arg 


Val 


Gly 


105 










110 






Ala 


Gly 


Ser 


Arg 


Ala 
125 


Leu 


Pro 


Pro 


Gly 


Leu 


Leu 


Arg 
140 


Gin 


Ala 


Asp 


Leu 


Gin 


Leu 


Glu 
155 


Gin 


Gin 


Glu 


Pro 


Glu 
160 


Asp 


Gly 
170 


Val 


Gly 


Ala 


Arg 


Asp 
175 


Gin 


Pro 


Asp 


Arg 


Gly 


Glu 


Pro 


Glu 


Pro 


185 










190 






Ala 


Pro 


Ala 


Gly 


Gin 
205 


Val 


Gly 


Lys 


Gin 


Leu 


Ala 


Val 
220 


Val 


His 


Gin 


Asp 


Asp 


Arg 


Pro 
235 


Ala 


His 


Arg 


Val 


His 
240 


Ala 


Pro 
250 


Gly 


Leu 


Gin 


Pro 


Pro 
255 


Glu 
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Leu Pro Asp' Pro Pro Ala Arg Ala Pro Arg 
260 265 

<210> 63 
<211> 800 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (800) 

<400> 63 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 
1 5 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arq 
20 25 30 



gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 
Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 



144 



cga gaa gcg cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc agg cag cac cat ggg cgc cgc cag cct gac cct gac cgt gca 240 
Arg Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg cca get get gag egg cat cgt gca gca gca gaa caa cct get 288 
Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 
85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gac cgt gtg ggg 336 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 HO 

cat caa gca get cca ggc ccg cgt get ggc tct aga gcg eta cct cca 384 
His Gin Ala Ala Pro Gly Pro Arg. Ala Gly Ser Arg Ala Leu Pro Pro 
115 120 125 

gga cca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 432 
Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 

cac cac ggc cgt gee ctg gaa cgc cag ctg gag caa caa gaa cct gag 480 
His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

cca gat ttg gga caa cat gac ctg gat gga gtg gga gcg cga gat cag 528 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 175 

caa eta cac cga gat cat eta cag cct gat cga gga gag cca gaa cca 576 
Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 
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180 185 190 

gca gga gaa gaa cga get gga cct get cca get gga caa gtg ggc aag 624 
Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro Ala Gly Gin Val Gly Lys 
195 200 205 

ctt gtg gaa ctg gtt caa cat cac caa ctg get gtg, gta cat caa gat 672 
Leu Val Glu Leu Val Gin His His Gin Leu Ala Val Val His Gin Asp 
210 215 220 

ttt cat cat gat cgt ggg egg cct gat egg cct gcg cat cgt gtt cac 720 
Phe His His Asp Arg Gly Arg Pro Asp Arg Pro Ala His Arg Val His 
tt> 230 235 240 

cgt get gag cat cgt gaa ccg cgt gcg cca ggg atg cag ccc cct gag 768 
Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Met Gin Pro Pro Glu 
245 250 255 

ctt cca gac ccg cct gee cgt gtg acg gat cc 800 
Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 
260 265 



<210> 64 
<211> 266 
<212> PRT 

<213> Artificial Sequence 



<400> 64 



Leu 


Glu 


Gin 


Arg 


Gin 


Gly 


Asp 


Phe 


Pro 


Pro 


Arg 


Arg 


Arg Arg 


His 


Ala 


1 








5 










10 










15 




Arg 


Gin 


Leu 


Ala 


Gin 


Arg 


Ala 


Val 


Gin 


Val 


Gin 


Gly 


Gly Glu 


Asp Arg 








20 










25 










30 






Ala 


Pro 


Gly 


His 


Arg 


Pro 


His 


Gin 


Gly 


Gin 


Ala 


Pro 


Arg 


Gly Ala Ala 






35 










40 










45 








Arg 


Glu 


Ala 


Arg 


Arg 


Gly 


His 


Arg 


Arg 


Tyr 


Val 


Pro 


Arg 


Leu 


Pro 


Gly 




50 










55 










60 










Arg 


Cys 


Arg 


Gin 


His 


His 


Gly 


Arg 


Arg 


Gin 


Pro 


Asp 


Pro 


Asp 


Arg 


Ala 


65 










70 










75 






80 


Gly 


Pro 


Pro 


Ala 


Ala 


Glu 


Arg 


His 


Arg 


Ala 


Ala 


Ala 


Glu 


Gin 


Pro 


Ala 










85 










90 










95 




Ala 


Arg 


His 


Arg 


Gly 


Pro 


Ala 


Ala 


Pro 


Ala 


Pro 


Ala 


Asp Arg 


Val 


Gly 








100 










105 










110 




His 


Gin 


Ala 


Ala 


Pro 


Gly 


Pro 


Arg 


Ala 


Gly 


Ser 


Arg 


Ala 


Leu 


Pro 


Pro 






115 










120 










125 








Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 


Gly 


Leu 


Leu 


Arg 


Gin 


Ala 


Asp 


Leu 




130 










135 










140 








His 


His 


Gly 


Arg 


Ala 


Leu 


Glu 


Arg 


Gin 


Leu 


Glu 


Gin 


Gin 


Glu 


Pro 


Glu 


145 










150 










155 










160 


Pro 


Asp 


Leu 


Gly 


Gin 


His 


Asp 


Leu 


Asp 


Gly 


Val 


Gly 


Ala 


Arg 


Asp 


Gin 










165 










170 










175 




Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 


Pro 


Asp 


Arg 


Gly 


Glu 


Pro 


Glu 


Pro 








180 










185 










190 






Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 


Ala 


Pro 


Ala 


Gly 


Gin 


Val 


Gly 


Lys 






195 










200 










205 






Leu 


Val 


Glu 


Leu 


Val 


Gin 


His 


His 


Gin 


Leu 


Ala 


Val 


Val 


His 


Gin 


Asp 




210 










215 










220 








Phe 


His 


His 


Asp 


Arg 


Gly 


Arg 


Pro 


Asp 


Arg 


Pro 


Ala 


His 


Arg 


Val 


His 


225 










230 










235 








240 
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Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Met Gin Pro Pro Glu 

245 250 255 

Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 
260 265 



<210> 65 
<211> 647 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (647) 



<400> 65 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

1' 5 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 

20 25 30 



gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 
Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 



144 



cga gaa gcg cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 



cgc tgc agg cag cac cat ggg cgc cgc cag cct gac cct gac cgt gca 
Arg Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg cca get get gag egg cat cgt gca gca gca gaa caa cct get 
Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 
85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gac cgt gtg ggg 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 110 

cat caa gca get cca ggc ccg cgt get ggc tct aga gcg eta cct cca 
His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 
115 120 125 

gga cca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 
Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 

cac cac ggc cgt gec ctg gaa cgc cag ctg gag caa caa gaa cct gag 
His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

cca gat ttg gga caa cat gac ctg gat gga gtg gga gcg cga gat cag 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 175 



240 



288 



336 



384 



432 



480 
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caa eta cac cga gat cat eta cag cct gat cga gga gag cca gaa cca 576 

Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 
180 185 190 

gca gga gaa gaa cga get gga cct get cca get gga caa gtg ggc aag 624 
Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro Ala Gly Gin Val Gly Lys 
195 200 205 

ctt gtg tga ctg att gag gat cc 647 
Leu Val * Leu lie Glu Asp 
210 



<210> 66 
<211> 214 
<212> PRT 

<213> Artificial Sequence 





<400> 


66 






















Leu 


Glu 


Gin 


Arg 


Gin 


Gly 


Asp 


Phe Pro Pro 


Arg 


Arg 


Arg 


Arg 


His 


Ala 


1 








5 






10 










15 




Arg 


Gin 


Leu 


Ala 


Gin 


Arg 


Ala 


Val Gin Val 


Gin 


Gly Gly 


Glu Asp Arg 








20 








25 








30 






Ala 


Pro 


Gly 


His 


Arg 


Pro 


His 


Gin Gly Gin Ala 


Pro 


Arg 


Gly Ala 


Ala 






35 










40 






45 








Arg 


Glu 
50 


Ala 


Arg 


Arg 


Gly 


His 
55 


Arg Arg Tyr 


Val 


Pro 
60 


Arg 


Leu 


Pro 


Gly 


Arg 


Cys 


Arg 


Gin 


His 


His 


Gly 


Arg Arg Gin 


Pro Asp 


Pro 


Asp Arg Ala 


65 










70 






75 










80 


Gly 


Pro 


Pro 


Ala 


Ala 
85 


Glu 


Arg 


His Arg Ala 
90 


Ala 


Ala 


Glu 


Gin 


Pro 
95 


Ala 


Ala 


Arg 


His 


Arg 
100 


Gly 


Pro 


Ala 


Ala Pro Ala 
105 


Pro 


Ala 


Asp 


Arg 
110 


Val 


Gly 


His 


Gin 


Ala 
115 


Ala 


Pro 


Gly 


Pro 


Arg Ala Gly 
120 


Ser 


Arg 


Ala 
125 


Leu 


Pro 


Pro 


Gly 


Pro 
130 


Ala 


Leu 


Pro 


Gly 


His 
135 


Val Gly Leu 


Leu 


Arg 
140 


Gin 


Ala 


Asp 


Leu 


His 


His 


Gly 


Arg 


Ala 


Leu 


Glu 


Arg Gin Leu 


Glu 


Gin 


Gin 


Glu 


Pro 


Glu 


145 










150 






155 










160 


Pro 


Asp 


Leu 


Gly 


Gin 
165 


His 


Asp 


Leu Asp Gly 
170 


Val 


Gly 


Ala 


Arg 


Asp 
175 


Gin 


Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin Pro Asp Arg 


Gly Glu 


Pro 


Glu 


Pro 








180 








185 








190 






Ala 


Gly 


Glu 
195 


Glu 


Arg 


Ala 


Gly 


Pro Ala Pro 
200 


Ala 


Gly 


Gin 
205 


Val 


Gly 


Lys 


Leu 


Val 
210 


Leu 


lie 


Glu 


Asp 



















<210> 67 
<211> 1918 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (1918) 



<400> 67 

get age gcg gee gac cgc ctg tgg gtg ace gtg tac tac ggc gtg ccc AS 
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Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

1 5 10 15 

gtg tgg aag gac gcc acc acc acc ctg ttc tgc gcc age gac gec aag ' 96 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

gcc tac gac acc gag gtg cac aac gtg tgg gcc acc cac gcg tgc gtg 144 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 

35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 

-Pro Thr Asp Pro Asn Pro Gin Glu Verl— Val Leu Gly Asn Val Thr Glu 

50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 

65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 

He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 3-36 

Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gcc ctg 432 

Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 

Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 

145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gcc tgc ccc aag 528 

Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 

165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gcc ccc gcc ggc ttc 576 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

gcc ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 

Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 

210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gcc gag gag gag gtg gtg 720 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 

225 230 235 240 
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ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 768 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 

Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gec ttc tac acc acc ggc 864 

Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

vprc ate ate ggc gac ate cgc cag gcrr-cac tgc aac ate tct aga acc 912 

Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gec gag aag ctg cgc gag aag 960 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 

305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 

Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 

Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 1104 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 1152 

Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 
370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gee atg tac gee ccc ccc 1200 

He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 

385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 

He Gly Gly Gin He Lys Cys Leu Ser Asn lie Thr Gly Leu Leu Leu 
405 410 415 

acc cgc gac ggc ggc age gac aac teg age age ggc aag gag att ttc 1296 

Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu He Phe 
420 425 430 

cgc ccc ggc ggc ggc gac atg cgc gac aac tgg cgc age gag ctg tac 134 4 

Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gec ccc acc aag 1392 

Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly He Ala Pro Thr Lys 
450 455 460 



gee aag cgc cgc gtg gtg cag cgc gag aag cgc gee gtg ggc ate ggc 
Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He Gly 
465 470 475 480 



1440 
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get atg ttc etc ggc ttc ctg ggc get gca ggc age ace atg ggc gec 1488 
Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
485 490 495 

gec age ctg ace ctg ace gtg cag gee cgc cag ctg ctg age ggc ate 1536 
Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He 
500 505 510 

gtg cag cag cag aac aac ctg ctg cgc gec ate gag gec cag cag cac 1584 
Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His 
515 520 525 

ctg etc cag ctg ace gtg tgg ggc ate aag cag etc cag gec cgc gtg 1632 
Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val 
530 535 540 

ctg get eta gag cgc tac etc cag gac cag cgc ttc ctg ggc atg tgg 1680 
Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 555 560 

ggc tgc tec ggc aag ctg ate tgc acc acg gec gtg ccc tgg aac gee 1728 
Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 
565 570 575 

age tgg age aac aag aac ctg age cag att tgg gac aac atg acc tgg 1776 
Ser Trp Ser Asn Lys Asn Leu Ser Gin He Trp Asp Asn Met Thr Trp 
580 585 590 

atg gag tgg gag cgc gag ate age aac tac acc gag ate ate tac age 1824 
Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu He He Tyr Ser 
595 600 605 

ctg ate gag gag age cag aac cag cag gag aag aac gag ctg gac ctg 1872 
Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 
610 615 620 

etc cag ctg gac aag tgg gca age ttg tgt gac tga ttg agg ate c 1918 
Leu Gin Leu Asp Lys Trp Ala Ser Leu Cys Asp * Leu Arg He 
625 630 635 



<210> 68 
<211> 638 
<212> PRT 

<213> Artificial Sequence 



<400> 68 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr Gly Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 




Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 






Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 
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85 90 95 



Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 








100 










105 










110 






Thr 


Asn 


Asn 


Thr 


Arg 


Trp 


Gly Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


lie 


Asp 


Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 




Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 










190 






Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 






195 










200 










205 








Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


lie 


Arg 


Pro 


Val 


Val 




210 










215 










220 










Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


225 










230 










235 










240 


He 


Arg 


Ser 


Glu 


Asn 


Phe 


Thr 


Asn 


Asn 


Ala 


Lys 


Thr 


He 


He 


Val 


Gin 










245 










250 










255 




Leu 


Asn 


Glu 


Ser 


Val 


Glu 


He 


Asn 


Cys 


Thr 


Arg 


Pro 


Asn 


Asn 


Asn 


Thr 








260 










265 










270 






Arg 


Lys 


Ser 


He 


His 


He 


Gly Pro 


Gly 


Arg 


Ala 


Phe 


Tyr 


Thr 


Thr 


Gly 






275 










280 










285 








Asp 


He 


He 


Gly 


Asp 


He 


Arg Gin Ala 


His 


Cys 


Asn 


lie 


Ser 


Arg 


Thr 




290 










295 










300 










Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala 


Glu 


Lys 


Leu 


Arg 


Glu 


Lys 


305 










310 










315 










320 


Phe 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 


Gly 


Asp 


Pro 










325 










330 










335 




Glu 


He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 








340 










345 










350 






Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 


Asn 


Ser 


Glu 






355 










360 










365 








Gly 


Asn 


He 


Thr 


Ser Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


lie 


Lys 


Gin 




370 










375 










380 










He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


385 










390 










395 










400 


He 


Gly 


Gly Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 


Leu 


Leu 


Leu 










405 










410 










415 




Thr 


Arg 


Asp 


Gly 


Gly Ser 


Asp 


Asn 


Ser 


Ser 


Ser 


Gly 


Lys 


Glu 


He 


Phe 








420 










425 










430 






Arg 


Pro 


Gly Gly 


Gly Asp 


Met 


Arg 


Asp 


Asn 


Trp Arg 


Ser 


Glu 


Leu 


Tyr 






435 










440 










445 








Lys 


Tyr 


Lys 


Val 


Val 


Lys 


He 


Glu 


Pro 


Leu 


Gly 


lie 


Ala 


Pro 


Thr 


Lys 




450 










455 










460 










Ala 


Lys 


Arg Arg 


Val 


Val 


Gin Arg 


Glu 


Lys 


Arg 


Ala 


Val 


Gly 


lie 


Gly 


465 










470 










475 










480 


Ala 


Met 


Phe 


Leu 


Gly 


Phe 


Leu 


Gly Ala 


Ala 


Gly 


Ser 


Thr 


Met 


Gly 


Ala 










485 










490 










495 




Ala 


Ser 


Leu 


Thr 


Leu 


Thr 


Val 


Gin 


Ala 


Arg 


Gin 


Leu 


Leu 


Ser 


Gly 


He 








500 










505 










510 






Val 


Gin 


Gin 


Gin 


Asn 


Asn 


Leu 


Leu 


Arg 


Ala 


He 


Glu 


Ala 


Gin 


Gin 


His 






515 










520 










525 








Leu 


Leu 


Gin 


Leu 


Thr 


Val 


Trp Gly 


He 


Lys 


Gin 


Leu 


Gin 


Ala 


Arg 


Val 




530 










535 










540 










Leu 


Ala 


Leu 


Glu 


Arg 


Tyr 


Leu 


Gin 


Asp 


Gin 


Arg 


Phe 


Leu 


Gly 


Met 


Trp 



545 550 555 560 
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\j± y 


^ys 


c ^ *~ 

ber 


Gly 


Lys 


Leu 


He 


Cys 


Thr 


Thr 


Ala 


Val 


Pro 


Trp Asn 


Ala 










c a c 

565 










570 








575 




Ser 


Trp 


Ser 


Asn 


Lys 


Asn 


Leu 


Ser 


Gin 


He 


Trp 


Asp 


Asn 


Met Thr 


Trp 








C O A 

580 










585 










590 


Me t 


blU 


Trp 


Glu 


Arg 


Glu 


He 


Ser 


Asn 


Tyr 


Thr 


Glu 


He 


lie Tyr 


Ser 






595 










600 










605 




Leu 


He 


Glu 


Glu 


Ser 


Gin 


Asn 


Gin 


Gin 


Glu 


Lys 


Asn 


Glu 


Leu Asp 


Leu 




610 










615 










620 






Leu 


Gin 


Leu 


Asp 


Lys 


Trp 


Ala 


Ser 


Leu 


Cys 


Asp 


Leu 


Arg 


He 




625 










630 










635 









<210> 69 
<211> 2071 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (2071) 

<400> 69 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

1 5 10 15 

gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gec tac gac acc gag gtg cac aac gtg tgg gee acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 
He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gec ctg 4 32 

Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 4 80 

Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
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145 iso 



155 160 



cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag- 
Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 
165 170 175 



age acc cag ctg ctg ctg aac ggc age ctg gec gag gag gag gtg gtg 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 



cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 
Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 



aac tgg acc aac acc ctg aag cgc gtg gee gag aag ctg cgc gag aag 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 

305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 

Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 

Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 



528 



gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gec ggc ttc 576 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gee ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 

Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 
210 215 220 



720 



ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 768 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 

245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 

Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 



864 



gac ate ate ggc gac ate cgc cag gee cac tgc aac ate tct aga acc 912 
Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 



960 



1008 



1056 



1104 



ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 115; 
Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 
370 375 380 



c ate aac atg tgg cag gag gtg ggc aag gee atg tac gee ccc ccc 



1200 
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He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 
He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
405 410 415 

acc cgc gac ggc ggc age gac aac teg age age ggc aag gag att ttc 1296 
Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu He Phe 
420 425 430 

cgc ccc ggc ggc ggc gac atg cgc gac aac tgg cgc age gag ctg tac 1344 
STg Pro Gly Gly Gly Asp Met Arg Asp"7\sh Trp Arg Ser Glu Leu Tyr 
435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gee ccc acc aag 1392 
Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 
450 455 460 

gee aag cgc cgc gtg gtg cag cgc gag aag cgc gee gtg ggc ate ggc 1440 
Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He Gly 
465 470 475 



480 



get atg ttc etc ggc ttc ctg ggc get gca ggc age acc atg ggc gee 
Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
485 490 495 



1488 



gee age ctg acc ctg acc gtg cag gee cgc cag ctg ctg age ggc ate 1536 
Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He 
500 505 510 

gtg cag cag cag aac aac ctg ctg cgc gee ate gag gee cag cag cac 1584 
Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His 
515 520 525 

ctg etc cag ctg acc gtg tgg ggc ate aag cag etc cag gee cgc gtg 1632 
Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val 
530 535 540 

ctg get eta gag cgc tac etc cag gac cag cgc ttc ctg ggc atg tgg 1680 
Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 555 560 

ggc tgc tec ggc aag ctg ate tgc acc acg gee gtg ccc tgg aac gee 1728 
Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 
565 570 575 

age tgg age aac aag aac ctg age cag att tgg gac aac atg acc tgg 1776 
Ser Trp Ser Asn Lys Asn Leu Ser Gin He Trp Asp Asn Met Thr Trp 
580 585 590 

atg gag tgg gag cgc gag ate age aac tac acc gag ate ate tac age 1824 
Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu He He Tyr Ser 
595 600 605 

ctg ate gag gag age cag aac cag cag gag aag aac gag ctg gac ctg 1872 
Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 
610 615 620 
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etc cag ctg gac aag tgg gca age ttg tgg aac tgg ttc aac ate acc 1920 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr 
625 630 635 640 

aac tgg ctg tgg tac ate aag att ttc ate atg ate gtg ggc ggc ctg 1968 
Asn Trp Leu Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu 
645 650 655 

ate ggc ctg cgc ate gtg ttc acc gtg ctg age ate gtg aac cgc gtg 2016 
He Gly Leu Arg He Val Phe Thr Val Leu Ser He Val Asn Arg Val 
660 665 670 

cgc cag gga tgc age ccc ctg age ttc cag acc cgc ctg ccc gtg tga 2064 
Arg Gin Gly Cys Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Val * 
675 680 685 

egg ate c 2071 
Arg He 



<210> 70 
<211> 689 
<212> PRT 

<213> Artificial Sequence 



<400> 70 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp Ala 


Lys 








20 










25 










30 




Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 






Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 








100 










105 










110 




Thr 


Asn 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Asn 


lie 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


He 


Asp 


Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 


Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 










190 






Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly Thr 


Gly 


Pro 


Cys 






195 










200 










205 








Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 




210 










215 










220 










Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


225 










230 










235 










240 


He 


Arg 


Ser 


Glu 


Asn 


Phe 


Thr 


Asn 


Asn 


Ala 


Lys 


Thr 


lie 


lie 


Val 


Gin 



245 250 255 
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Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 

260 265 270 

Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 

275 280 285 

Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arq Thr 

290 295 300 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 330 335 

Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

340 345 350 

7^sn Thr Thr Gin Leu Phe Asn Ser Th*f""Trp Asn Glu Thr Asn Ser Glu 

355 360 365 

Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 

370 375 380 

He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 

405 410 415 

Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu lie Phe 

420 425 430 

Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 

435 440 445 

Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly lie Ala Pro Thr Lys 

450 455 460 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly 
465 470 475 480 

Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 

485 490 495 

Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly lie 

500 505 510 

Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His 

515 520 525 

Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val 

530 535 540 

Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 555 560 

Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ala 

565 570 575 

Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp Asp Asn Met Thr Trp 

580 585 590 

Met Glu Trp Glu Arg Glu lie Ser Asn Tyr Thr Glu lie lie Tyr Ser 

595 600 605 

Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 

610 615 620 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn lie Thr 
625 630 635 640 

Asn Trp Leu Trp Tyr lie Lys lie Phe lie Met lie Val Gly Gly Leu 

645 650 655 

lie Gly Leu Arg lie Val Phe Thr Val Leu Ser He Val Asn Arg Val 

660 665 670 

Arg Gin Gly Cys Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Val Arg 
675 680 685 

lie 



<210> 
<211> 



71 

2469 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (2469) 



<400> 71 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
1 5 10 15 



aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 



ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 

Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 HO 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 

Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 



cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 
Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 
165 170 175 



48 



grg tgg aag gac gee acc acc acc ctg-ttc tgc gee age gac gee aag 96 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gee tac gac acc gag gtg cac aac gtg tgg gee acc cac gcg tgc gtg 144 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 



240 



ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 
He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
85 90 95 



336 



384 



432 



ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 
Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 



528 



gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gee ggc ttc 576 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

gee ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 62-; 

Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 
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acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val . 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gee gag gag gag gtg gtg 720 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 768 
He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 
Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 864 
Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate ggc gac ate cgc cag gee cac tgc aac ate tct aga acc 912 
Asp He He Gly Asp He Arg Gin Ala His Cys Asn lie Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gee gag aag ctg cgc gag aag 960 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 
Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 
Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 1104 
Asn Thr Thr Gin Leu Phe Asn Ser Thr- Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc. cgc ate aag cag 1152 
Gly Asn He Thr Ser Gly Thr lie Thr Leu Pro Cys Arg He Lys Gin 
370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gee atg tac gee ccc ccc 1200 
lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 
lie Gly Gly Gin lie Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
405 410 415 

acc cgc gac ggc ggc age gac aac teg age age ggc aag gag att ttc 1296 
Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu He Phe 
420 425 430 



cgc ccc ggc ggc ggc gac atg cgc gac aac tgg cgc age gag ctg tac 
Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
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435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gec ccc acc aag 1392 
Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 
450 455 460 

gec aag cgc cgc gtg gtg cag cgc gag aag cgc gec gtg ggc ate ggc 14 4 0 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He Gly 
465 470 475 480 

get atg ttc etc ggc ttc ctg ggc get gca ggc age acc atg ggc gec 1488 
Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
485 490 495 

gec age ctg acc ctg acc gtg cag gee cgc cag ctg ctg age ggc ate 1536 
Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He 
500 505 510 

gtg cag cag cag aac aac ctg ctg cgc gec ate gag gec cag cag cac 1584 
Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His 
515 520 525 

ctg etc cag ctg acc gtg tgg ggc ate aag cag etc cag gee cgc gtg 1632 
Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val 
530 535 540 

ctg get eta gag cgc tac etc cag gac cag cgc ttc ctg ggc atg tgg 1680 
Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 555 560 



640 



1728 



1776 



1824 



ggc tgc tec ggc aag ctg ate tgc acc acg gee gtg ccc tgg aac gee 
Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 
565 570 575 

age tgg age aac aag aac ctg age cag att tgg gac aac atg acc tgg 

Ser Trp Ser Asn Lys Asn Leu Ser Gin He Trp Asp Asn Met Thr Trp 
580 585 590 

atg gag tgg gag cgc gag ate age aac tac acc gag ate ate tac age 

Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu He lie Tyr Ser 
595 600 605 

ctg ate gag gag age cag aac cag cag gag aag aac gag ctg gac ctg 1872 

Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 
610 615 620 

etc cag ctg gac aag tgg gca age ttg tgg aac tgg ttc aac ate acc 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn lie Thr 
625 630 635 



1920 



aac tgg ctg tgg tac ate aag att ttc ate atg ate gtg ggc ggc ctg 1968 

Asn Trp Leu Trp Tyr lie Lys He Phe lie Met lie Val Gly Gly Leu 
645 650 655 

ate ggc ctg cgc ate gtg ttc acc gtg ctg age ate gtg aac cgc gtg 2016 

lie Gly Leu Arg lie Val Phe Thr Val Leu Ser He Val Asn Arg Val 

660 665 670 

cgc cag ggc tac age ccc ctg age ttc cag acc cgc ctg ccc gtg ccc 2064 
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Arg Gin Gly. 'Tyr Ser Pro Leu Ser- Phe Gin Thr Arg Leu Pro Val Pro 
675 680 685 

cgc ggc ccc gac cgc ccc gag ggc ate gag gag gag ggc ggc gag cgc 2112 
Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly Glu Arg 
690 695 70Q 

gac cgc gac cgc age acc cgc ctg gtg acc ggc ttc ctg ccc ctg ate 2160 
Asp Arg Asp Arg Ser Thr Arg Leu Val Thr Gly Phe Leu Pro Leu He 
705 710 715 720 

tgg gac gac ctg cgc age ctg ttc ctg ttc age tac cat cga ttg cgc 2208 
?*p Asp Asp Leu Arg Ser Leu Phe Leu-Phe Ser Tyr His Arg Leu Arg 
725 730 735 

gac ctg ctg ctg ate gtg gee cgc ate gtg gag ctg . ctg ggc egg cgc 2256 
Asp Leu Leu Leu He Val Ala Arg He Val Glu Leu Leu Gly Arq Arc 
7 «0 745 750 



ggc tgg gag ate ctg aag tac tgg tgg aac ctg etc cag tac tgg age 
Gly Trp Glu He Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr Trp Ser 
755 760 765 



gec gtg gee gag ggc acc gac cgc gtg ate gag gtg gtg cag cgc ate 

Ala Val Ala Glu Gly Thr Asp Arg Val He Glu Val Val Gin Arg He 

785 7 90 795 800 

tgg cgc ggc ate ctg cac ate ccc acc cga att cgc cag ggc ttc gag 

Trp Arg Gly He Leu His He Pro Thr Arg He Arg Gin Gly Phe Glu 

805 810 815 

cgc gec ctg ctg taa gga tec 

Arg Ala Leu Leu * Gly Ser 
820 



<210> 72 
<211> 822 
<212> PRT 

<213> Artificial Sequence 
<400> 72 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 


Ala 






20 










25 










30 




Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 






Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly 


Asn 


Val 


Thr 


Glu 


Asn 
65 


50 










55 










60 








Phe 


Asn 


Met 


Gly 


Lys 
70 


Asn 


Asn 


Met 


Val 


Glu 
75 


Gin 


Met 


His 


Glu 


Asp 
80 


lie 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 


Pro 








85 










90 










95 




Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 



2304 



cag gag ctg aag aac tct gca gtg age ctg ctg aac gec acc gee ate 2352 
Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr Ala He 
770 775 780 



2400 



2448 



2469 
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100 105 no 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 

165 170 175 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 

210 215 220 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 

245 250 255 

Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 

260 265 270 

Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 

275 280 285 

Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 

290 295 300 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 330 335 

Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

340 345 350 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 

355 360 365 

Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 

370 375 380 

lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

He Gly Gly Gin He Lys Cys Leu Ser Asn lie Thr Gly Leu Leu Leu 

405 410 415 

Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu lie Phe 

420 425 430 

Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 

435 440 445 

Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly lie Ala Pro Thr Lys 

450 455 460 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly 
465 470 475 480 

Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 

485 490 495 

Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He 

500 505 510 

Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His 

515 520 525 

Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val 

530 535 540 

Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 555 560 

Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ala 
565 570 575 
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Ser Trp Ser Asn Lys Asn Leu Ser Gin He Trp Asp Asn Met Thr Trp 

580 585 590 

Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu He He Tyr Ser 

59 5 600 605 

Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 

610 615 620 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr 

f 25 „ 630 635 640 

Asn Trp Leu Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu 

645 650 655 

He Gly Leu Arg He Val Phe Thr Val Leu Ser He Val Asn Arg Val 

660 665 670 

Asg Gin Gly Tyr Ser Pro Leu Ser Phe-Gln Thr Arg Leu Pro Val Pro 

675 680 685 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly Glu Arq 

6 90 695 700 

Asp Arg Asp Arg Ser Thr Arg Leu Val Thr Gly Phe Leu Pro Leu He 
705 710 715 720 

Trp Asp Asp Leu Arg Ser Leu Phe Leu Phe Ser Tyr His Arg Leu Ara 

725 730 735 

Asp Leu Leu Leu He Val Ala Arg He Val Glu Leu Leu Gly Arg Arq 

740 745 750 

Gly Trp Glu lie Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr Trp Ser 

755 760 765 

Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr Ala He 

770 775 780 

Ala Val Ala Glu Gly Thr Asp Arg Val He Glu Val Val Gin Arq He 
785 790 795 soo 

Trp Arg Gly He Leu His He Pro Thr Arg He Arg Gin Gly Phe Glu 

805 810 815 

Arg Ala Leu Leu Gly Ser 
820 

<210> 73 
<211> 1431 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (1431) 

<400> 73 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 
Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
1 5 10 15 

gtg tgg aag gac gec acc acc acc ctg ttc tqc gec age gac gec aag 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lvs 
20 25 30 



48 



96 



gec tac gac acc gag gtg cac aac gtg tgg gee acc cac gcg tgc qtq 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag ' « 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 
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aac ttc aac'atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 



65 70 



75 80 



ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 
He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
85 90 -95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 HO 

*gc aac aac acc cgc tgg ggc acc cag-gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 432 
Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 



130 135 



140 



ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 
Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 



480 



cgc ctg cgc age tgc aac aca teg ate ate acc cag gec tgc ccc aag 528 
Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 
165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gec ggc ttc 576 
Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gee ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 
Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
1^5 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gee gag gag gag gtg gtg 720 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gee aag acc ate ate gtg cag 768 
He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 
Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 864 
Arg Lys Ser lie His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate ggc gac ate cgc cag gee cac tgc aac ate tct aga acc 912 
Asp He He Gly Asp He Arg Gin Ala His Cys Asn lie Ser Arg Thr 
290 295 300 
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aac tgg acc 
Asn Trp Thr 
305 

ttc aac aac 
Phe Asn Asn 



gag ate gtg 
Glu He Val 



aac acc acc 
Asn Thr Thr 
355 

ggc aac ate 
Gly Asn He 
370 

ate ate aac 
He He Asn 
385 

ate ggc ggc 
He Gly Gly 



acc cgc gac 
Thr Arg Asp 



cgc ccc ggc 
Arg Pro Gly 
435 

aag tac aag 
Lys Tyr Lys 
450 

gee aag cgc 
Ala Lys Arg 
465 



aac acc ctg 
Asn Thr Leu 
310 

acc acc ate 
Thr Thr He 
325 

atg cac age 
Met His Ser 
340 

cag ctg ttc 
Gin Leu Phe 



act agt ggc 
Thr Ser Gly 



atg tgg cag 
Met Trp Gin 
390 

cag ate aag 
Gin He Lys 
405 

ggc ggc age 
Gly Gly Ser 
420 

ggc ggc gac 
Gly Gly Asp 



gtg gtg aag 
Val Val Lys 



cgc gtg gtg 
Arg Val Val 
470 



aag cgc gtg 
Lys Arg Val 



gtg ttc aac 
Val Phe Asn 



ttc aac tgc 
Phe Asn Cys 
345 

aac age acc 
Asn Ser Thr 
360 

acc ate acc 
Thr He Thr 
375 

gag gtg ggc 
Glu Val Gly 



tgc ctg age 
Cys Leu Ser 



gac aac teg 
Asp Asn Ser 
425 

atg cgc gac 
Met Arg Asp 
440 

ate gag ccc 
He Glu Pro 
455 

cag cgc gag 
Gin Arg Glu 



gec gag aag 
Ala Glu Lys 
315 

cag age tec 
Gin Ser Ser 
330 

ggc ggc gag 
Gly Gly Glu 



tgg aac gag 
Trp Asn Glu 



ctg ccc tgc 
Leu Pro Cys 
380 

aag gee atg 
Lys Ala Met 
395 

aac ate acc 
Asn He Thr 
410 

age age ggc 
Ser Ser Gly 



aac tgg cgc 
Asn Trp Arg 



ctg ggc ate 
Leu Gly lie 
4 60 

aa.g cgc gee 
Lys Arg Ala 
475 



ctg cgc gag 
Leu Arg Glu 



ggc ggc gac 
Gly Gly Asp 
335 

ttc ttc tac 
Phe Phe Tyr 
350 

acc aac age 
Thr Asn Ser 
365 

cgc ate aag 
Arg He Lys 



tac gec ccc 
Tyr Ala Pro 



ggc ctg ctg 
Gly Leu Leu 
415 

aag gag att 
Lys Glu He 
430 

age gag ctg 
Ser Glu Leu 
445 

gee ccc acc 
Ala Pro Thr 



tag 



aag 960 

Lys 

320 

ccc 1008 
Pro 



tgc 1056 
Cys 



gag 1104 
Glu 



cag 1152 
Gin 



ccc 1200 

Pro 

400 

ctg 1248 
Leu 



ttc 1296 
Phe 



tac 1344 
Tyr 



aag 1392 
Lys 



1431 





<210> 


74 










<211> 


476 










<212> 


PRT 










<213> 


Artificial , 


Sequence 




<400> 


74 








Ala 


Ser Ala 


Ala Asp 


Arg 


Leu 


Trp 


1 




5 








Val 


Trp Lys 


Asp Ala 


Thr 


Thr 


Thr 






20 








Ala 


Tyr Asp 


Thr Glu 


Val 


His 


Asn 




35 








40 


Pro 


Thr Asp 


Pro Asn 


Pro 


Gin 


Glu 



Val Thr Val Tyr Tyr Gly Val Pro 

10 15 
Leu Phe Cys Ala Ser Asp Ala Lys 
25 30 
Val Trp Ala Thr His Ala Cys Val 
45 

Val Val Leu Gly Asn Val Thr Glu 



WO 00/29561 



54 



PCT/DK00/00144 



50 55 60 



Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lvs 


Asn 


Ser 


Thr 


A ^rv 
no 








100 










105 










110 




Thr 


Asn 


Asn 


Thr 


Arg 




Gly Thr Gin Glu 


Met 


Lvs 


Asn 


Cys 


Ser 


Phe 






115 










X U 










125 






Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


rto 1 1 


Jjys 


Mot- 


Lvs 


Arg 


Glu 


Tvr 


Ala 


Leu 




130 










135 










140 








Phe 


Tyr 


Ser 


Leu 


Asp 


lit: 


val 


Pro 

t 1U 


lie 


Asp 


Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 












1 SO 

1 Ju 










155 










IDU 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Th r 


Q<ar 

OCi 


1 J.C 


T 1 o 

lie 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










171; 

1 / z> 


Val 


Ser 


Phe 


Glu 


Pro 
t i \j 


lie 


Pro 
riu 


Tip 
lie 


His 


Phe 


Cys 


Ala 


Pro 


ni d 


vji y 


Phe 








180 










185 










1 90 




Ala 


He 


Leu 


Lys 


Cys 


Bon 
r\j 11 


no 11 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 






195 










200 










205 




Thr 


Asn 


Val 


Ser 


Thr 

X 1 1 1 


V Cll 


Olll 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 




210 










^ i j 










220 








Ser 


Thr 


Gin 


Leu 


Leu 


Toil 
lit? u 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


225 










230 










235 










9 a n 


He 


Arg 


Ser 


Glu 


Asn 
245 


Phe 


Thr 


Asn 


Asn 


Ala 
250 


Lvs 


Thr 


He 


He 


Val 

£. J J 


G1 n 

Ulll 


Leu 


Asn 


Glu 


Ser 
260 


Val 


Glu 


11C 


Asn 


Cys 
265 


Thr 


Arq 


Pro 


Asn 


Asn 
270 


Asn 


Thr 


Arg 


Lys 


Ser 


He 


His 


He 


Gly 


Pro 


Gly Arg 


Ala 


Phe 


Tyr 


Thr 


Thr 


01 y 






275 










280 










285 






Asp 


He 


He 


Gly Asp 


11C 


Arg 


Gin 


Ala 


His 


Cys 


Asn 


He 


Ser 


Arg 


Thr 




290 










295 










300 








Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg Val 


Ala 


Glu 


Lys 


Leu 


Arg Glu 


Lys 


305 










310 










315 












Phe 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 


Gly Asp 


Pro 










325 










330 








335 




Glu 


He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Gly Glu 


Phe 


Phe 


Tyr 


Cys 








340 










345 










350 


Asn 


Thr 


Thr 
355 


Gin 


Leu 


Phe 


Asn 


Ser 
360 


Thr 


Trp 


Asn 


Glu 


Thr 
365 


Asn 


Ser 


Glu 


Gly 


Asn 


He 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 




370 










375 










380 






He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


385 










390 










395 










4 nn 


He 


Gly 


Gly Gin 


lie 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Glv 


Leu 


Leu 


Leu 










405 










410 










415 




Thr 


Arg 


Asp Gly Gly 


Ser 


Asp Asn 


Ser 


Ser 


Ser 


Gly 


Lys 


Glu 


He 


tilt; 








420 










425 










430 






Arg 


Pro 


Gly Gly Gly Asp Met Arg Asp Asn 


Trp Arg 


Ser 


Glu 


Leu 


Tyr 






435 










440 










445 






Lys 


Tyr 


Lys 


Val 


Val 


Lys 


He 


Glu 


Pro 


Leu 


Gly 


He 


Ala 


Pro 


Thr 


Lys 




450 










455 










460 








Ala 


Lys 


Arg 


Arg 


Val 


Val 


Gin 


Arg 


Glu 


Lys 


Arg 


Ala 










465 










470 








475 













<210> 75 
<211> 1038 
<212> DNA 

<213> Artificial Sequence 



WO 00/29561 



PCT/DKOO/00144 



55 



<220> 

<221> CDS 

<222> (1) . . . (1038) 

<400> 75 

gcc gtg ggc ate ggc get atg ttc etc ggc ttc ctg ggc get gca ggc 48 

Ala Val Gly He Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly 

15 10 15 

age ace atg ggc gcc gcc age ctg ace ctg acc gtg cag gcc cgc cag 96 
Ser Thr Met Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin 
20 25 30 

ctg ctg age ggc ate gtg cag cag cag aac aac ctg ctg cgc gcc ate 14 4 

Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
35 40 45 

gag gcc cag cag cac ctg etc cag ctg acc gtg tgg ggc ate aag cag 192 
Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
50 55 60 



etc cag gcc cgc gtg ctg get eta gag cgc tac etc cag gac cag cgc 
Leu Gin Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg 
65 70 75 80 



aac gag ctg gac ctg etc cag ctg gac aag tgg gca age ttg tgg aac 
Asn Glu Leu Asp Leu Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn 
145 150 155 



160 



240 



ttc ctg ggc atg tgg ggc tgc tec ggc aag ctg ate tgc acc acg gcc 
Phe Leu Gly Met Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala 
85 90 95 

gtg ccc tgg aac gcc age tgg age aac aag aac ctg age cag att tgg 
Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Asn Leu Ser Gin He Trp 
100 105 no 

gac aac atg acc tgg atg gag tgg gag cgc gag ate age aac tac acc 
Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr 
115 120 125 

gag ate ate tac age ctg ate gag gag age cag aac cag cag gag aag 432 
Glu He He Tyr Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys 
130 135 140 



288 



336 



384 



480 



tgg ttc aac ate acc aac tgg ctg tgg tac ate aag att ttc ate atg 528 
Trp Phe Asn lie Thr Asn Trp Leu Trp Tyr He Lys He Phe He Met 
165 170 175 

ate gtg ggc ggc ctg ate ggc ctg cgc ate gtg ttc acc gtg ctg age 576 
He Val Gly Gly Leu He Gly Leu Arg He Val Phe Thr Val Leu Ser 
180 185 190 

ate gtg aac cgc gtg cgc cag ggc tac age ccc ctg age ttc cag acc 6; 4 

He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr 
195 200 205 

cgc ctg ccc gtg ccc cgc ggc ccc gac cgc ccc gag ggc ate gag gag - 
Arg Leu Pro Val Pro Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu 
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210 215 220 

gag ggc ggc gag cgc gac cgc gac cgc age acc cgc ctg gtg acc ggc 720 
Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Thr Arg Leu Val Thr Gly 
225 230 235 240 

ttc ctg ccc ctg ate tgg gac gac ctg cgc age ctg ttc ctg ttc age 768 
Phe Leu Pro Leu He Trp Asp Asp Leu Arg Ser Leu Phe Leu Phe Ser 
245 250 255 

tac cat cga ttg cgc gac ctg ctg ctg ate gtg gee cgc ate gtg gag 816 
Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Ala Arg He Val Glu 
260 265 270 

ctg ctg ggc egg cgc ggc tgg gag ate ctg aag tac tgg tgg aac ctg 864 
Leu Leu Gly Arg Arg Gly Trp Glu He Leu Lys Tyr Trp Trp Asn Leu 
275 280 285 

etc cag tac tgg age cag gag ctg aag aac tct gca gtg age ctg ctg 912 
Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn. Ser Ala Val Ser Leu Leu 
290 295 300 

aac gee acc gec ate gee gtg gee gag ggc acc gac cgc gtg ate gag 960 
Asn Ala Thr Ala He Ala Val Ala Glu Gly Thr Asp Arg Val He Glu 
305 310 315 320 

gtg gtg cag cgc ate tgg cgc ggc ate ctg cac ate ccc acc cga att 1008 
Val Val Gin Arg He Trp Arg Gly He Leu His He Pro Thr Arg He 
325 330 335 

cgc cag ggc ttc gag cgc gec ctg ctg taa 1038 
Arg Gin Gly Phe Glu Arg Ala Leu Leu * 
340 345 



<210> 76 
<211> 345 
<212> PRT 

<213> Artificial Sequence 



<400> 76 



Ala 


Val 


Gly 


He 


Gly 


Ala 


Met 


Phe 


Leu Gly 


Phe 


Leu 


Gly 


Ala 


Ala 


Gly 


1 








5 








10 










15 


Ser 


Thr 


Met 


Gly 


Ala 


Ala 


Ser 


Leu 


Thr Leu 


Thr 


Val 


Gin 


Ala 


Arg 


Gin 








20 










25 








30 




Leu 


Leu 


Ser 


Gly 


lie 


Val 


Gin 


Gin 


Gin Asn 


Asn 


Leu 


Leu 


Arg 


Ala 


He 






35 










40 








45 






Glu 


Ala 


Gin 


Gin 


His 


Leu 


Leu 


Gin 


Leu Thr 


Val 


Trp 


Gly 


He 


Lys 


Gin 




50 










55 








60 








Leu 


Gin 


Ala 


Arg 


Val 


Leu 


Ala 


Leu 


Glu Arg 


Tyr 


Leu 


Gin 


Asp 


Gin 


Arg 


65 










70 








75 










80 


Phe 


Leu 


Gly 


Met 


Trp 


Gly 


Cys 


Ser 


Gly Lys 


Leu 


He 


Cys 


Thr 


Thr 


Ala 










85 








90 










95 




Val 


Pro 


Trp 


Asn 


Ala 


Ser 


Trp 


Ser 


Asn Lys 


Asn 


Leu 


Ser 


Gin 


He 


Trp 








100 










105 








110 




Asp Asn 


Met 


Thr 


Trp 


Met 


Glu 


Trp 


Glu Arg 


Glu 


He 


Ser 


Asn 


Tyr 


Thr 






115 










120 








125 






Glu 


He 


He 


Tyr 


Ser 


Leu 


He 


Glu 


Glu Ser 


Gin 


Asn 


Gin 


Gin 


Glu 


Lys 




130 










135 








140 
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Asn 


Glu 


Leu 


Asp 


Leu 


Leu 


Gin 


Leu 


145 










150 






Trp 


Phe 


Asn 


He 


Thr 


Asn 


Trp 


Leu 










165 








He 


Val 


Gly 


Gly 


Leu 


He 


Gly 


Leu 








180 










He 


Val 


Asn 


Arg 


Val 


Arg 


Gin 


Gly 






195 










200 


Arg 


Leu 


Pro 


Val 


Pro 


Arg 


Gly 


Pro 




210 










215 




Glu 


Gly 


Gly 


Glu 


Arg 


Asp 


Arg 


Asp 


225 










230 




Leu 


Pro 


Leu 


He 


Trp 


Asp 


Asp 










245 








Tyr 


His 


Arg 


Leu 


Arg 


Asp 


Leu 


Leu 








260 










Leu 


Leu 


Gly 


Arg 


Arg 


Gly 


Trp 


Glu 






275 










280 


Leu 


Gin 


Tyr 


Trp 


Ser 


Gin 


Glu 


Leu 




290 










295 




Asn 


Ala 


Thr 


Ala 


lie 


Ala 


Val 


Ala 


305 










310 






Val 


Val 


Gin 


Arg 


He 


Trp 


Arg 


Gly 










325 








Arg 


Gin 


Gly 


Phe 


Glu 


Arg 


Ala 


Leu 



340 



ASD 


Lvs 




Ala 


Ser 


Leu 


l rp 


Asn 






155 










1 fin 


Trp 


Tvr 


lie 


Lys 


He 


Phe 


Tip 
lie 


Met- 

trie u 




170 










17 5 




Ara 


He 


Val 


Phe 


Thr 


v a jl 


Leu 


oer 


185 










190 






Tvr 


Ser 


Pro 


Leu 


Ser 


Phe 




Th v 

i rir 










205 








Asp 


Arg 


Pro 


Glu 


Gly 


He 












220 










Arg 


Ser 


Thr 


Arg 


Leu 


Val 


Thr 


\3 .l y 






235 










240 


Leu-Arg 


Ser 


Leu 


Phe 


Leu 


Phe 






250 










£. *J *J 




Leu 


He 


Val 


Ala 


Arg 


He 

J. J. c 


V O.JL 


vJlU 


265 










270 






He 


Leu 


Lys 


Tvr 


Trn 




Aon 


Leu 










285 








Lys 


Asn 


Ser 


Ala 


Val 


Ser 


T .01 1 


Leu 








300 










Glu 


Gly 


Thr 


Asp 


Arg 


Val 


He 


Glu 






315 










320 


He 


Leu 


His 


lie 


Pro 


Thr 


Arg 


He 




330 










335 





Leu 



345 



